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Preface 

When  this  study  was  first  formulated,  it  was  hoped 
that  it  could  examine  aircraft  of  several  types  built  by 
several  manufacturers.  It  was  felt  that  the  results  of  a 
broad  study  would  have  extensive  application  within  the  air 
force  and  the  airframe  industry.  It  was  a  disappointment  to 
find  only  one  manufacturer  willing  to  provide  information 
necessary  to  support  this  study.  Even  though  he  requires 
anonymity,  without  his  information  this  study  would  have  been 
impossible,  and  I  wish  to  express  my  thanks  for  his  coopera¬ 
tion  and  my  hopes  that  he  will  find  the  ideas  contained  in 
this  study  useful. 

I  am  also  deaply  grateful  to  the  faculty  of  the  Sys¬ 
tems  Management  Department  of  the  School  of  Engineering  in 
the  Air  Force  Institute  of  Technology  for  allowing  me  to  de¬ 
velop  this  study  in  my  own  way.  Though  it  is  not  as  learned 
as  it  would  be  if  each  step  were  directed  by  a  professor,  the 
results  appear  useful,  and  I  have  benefited  greatly  from  the 
nearly  ideal  learning  situation  they  have  created.  More  di¬ 
rect  involvement  on  the  faculty's  part  could  have  reduced  the 
number  of  dead  ends  pursued  and  tangents  explored,  but  this 
would  have  been  at  the  expense  of  my  profiting  irom  my  own 
.  mistakes. 

Roger  M.  Smith 
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Definitions 

AMPR  Weight.  The  empty  weight  of  the  aircraft  less 
engines,  wheels,  tires,  fuel  calls,  electronic  equipment,  in¬ 
struments,  and  other  equipment  as  defined  in  the  Aeronautical 
Manufacturers*  Planning  Report  (Ref  28:4-6). 

Critical  Point.  If  the  laboi  costs  for  each  aircraft 
are  plotted  against  some  performance  or  engineering  variable, 
one  point  will  appear  more  discontinuous  than  the  others. 

This  discontinuity  might  be  small  or  large.  The  aircraft  re¬ 
presenting  the  point  will  vary  depending  upon  the  specific 
variable  against  which  labor  costs  are  being  compared.  For 
any  particular  variable,  the  most  discontinuous  point  is 
designated  the  critical  point. 

Labor  Costs.  Labor  costs  will  be  used  as  a  general 
term  to  refer  to  a  quantity  of  labor  resource  that  must  be 
expended  to  perform  a  task.  Whether  the  quantity  is  measured 
in  hours,  hours  per  pound,  or  dollars  is  unimportant.  All  of 
these  units  of  measure,  and  others r  can  be  used  to  describe 
the  same  labor  expenditure. 

Labor  Hours.  The  number  of  direct  labor  manhours  re¬ 
quired  to  manufacture  an  airframe  and  to  install  equipment 
necessary  to  transform  the  airframe  into  a  flyable  aircraft. 
This  study  uses  without  modification  the  detailed  definition 
of  direct  labor  hours  used  in  the  accounting  system  of  the 
manufacturer  who  supplied  the  data  for  this  study.  This  defi 
nition  conforms  generally  to  the  definition  of  direct  labor 
hours  given  in  Asher  (Ref  3:48-50). 
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Maximum  Gross  Weight.  An  aircraft's  maximum  allowable 
gross  weight  for  take-off  for  normal  operation  under  standard 
atmospheric  conditions. 

Maximum  Speed.  The  maximum  true  airspeed  in  knots 
attainable  at  any  altitude  regardless  of  whether  the  limiting 
factor  is  lack  of  additional  power  or  structural  limitations. 
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H  A  general  symbol  for  the  total  number  of  hours  required 
to  manufacture  the  first  unit*  H  is  found  by  multiply¬ 
ing  a  specific  Y  fey  its  corresponding  A&PR  weight. 

H.j  Y1  multiplied  by  AHPR  weight. 

Y„  multiplied  by  AMPR  weight. 

cL 

r  Learning  fraction.  A  number  between  zero  and  one  which 
gives  the  rate  of  reduction  in  labor  requirements  that 
occur  in  each  ten  year  period. 

R2  Coefficient  of  determination. 

p 

Rf  The  partial  coefficient  of  determination  for  the  ith 
independent  variable  in  the  equation  being  examined. 

T  A  general  symbol  for  time. 

Texp  Exponential  time.  A  time  scale  found  by  subtracting 
some  base  year  from  the  start  production  dates  of  all 
the  aircraft  in  the  study.  The  name  exponential  is 
used  to  distinguish  this  time  scale  and  its  use  from 
straight  line  time  below,  Yfaen  this  time  scale  is  used 
in  log  regressions,  the  resulting  regression  coeffi¬ 
cients  are  exponents  after  the  inverse  logarithmic 
transformation,  hence  the  name.  Y/hen  a  specific  value 
of  the  exponent  is  being  considered,  the  term  "exp" 
will  be  replaced  by  a  number,  i.e.,  T~*^ 


is  the  time 
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scale  with  each  of  its  terns  raised  to  the  -.3  power. 

Tp  Straight  line  time.  A  time  adjustment  factor  that  will 
reduce  a  quantity  to  a  certain  fraction  of  its  original 
value  for  each  ten  years  being  considered.  When  a  spe¬ 
cific  learning  fraction  is  "being  considered,  the  learn¬ 
ing  fraction  ”r"  will  "be  replaced  with  a  number,  he., 

T  0.  See  chapter  V,  page  49. 

T  0  Straight  line  time  adjustment  factor  using  r  =  .8  and 

base  year  equal  to  the  start  production  date  of  the  last 
aircraft  in  the  sample.  See  chapter  V,  page  50. 

V  Maximum  speed. 

W&  AMPR  weight. 

W  Maximum  gross  weight. 

o 

Ytg_a  Maximum  gross  weight  minus  AMPR  weight. 

Y  The  symbol  Y  will  be  used  a3  a  general  symbol  to  repre¬ 
sent  a  number  in  direct  labor  hours  per  AMPR  pound  of 
aircraft  weight.  The  number  will  alwsys  be  obtained 
from  the  regression  equation  that  best  fits  the  program 
in  question.  When  subscripted,  it  will  represent  the 
number  of  hours  per  pound  to  produce  the  cumulative 
unit  designated  by  the  subscript,  (i.e.,  yiqo  re:fer8 
to  the  value  given  by  the  regression  line  to  produce 
the  100th  unit) 
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Y.  The  parameter  of  the  learning  curve  equation  found  by 
regressing  the  average  labor  hours  per  pouna  p°r  air¬ 
craft  in  all  lots  produced  before  unit  100  on  the  cumu¬ 
lative  lot  mid  points.  When  the  program  in  question 
had  a  prototype,  the  prototype  lot  was  excluded  and 
counting  began  with  the  first  lot  after  the  prototype. 

Y  The  parameter  of  the  learning  curve  equation  found  in 
the  same  manner  as  Y^  except  an  adjustment  factor  of 
from  0.0  to  1.0  is  added  to  all  cumulative  lot  mid 
point 8  before  regression.  See  chapter  III. 

Y  ,  The  value  of  Ya  found  by  extrapolating  from  lot  1  mid 

a—  1  a 

point  to  the  Y  axis  using  the  slope  of  the  Y&  learning 
curve  equation. 

Y*  A  value  in  hours  per  AMPR  pound  for  the  first  unit  labor 
costs  found  by  inserting  the  independent  variables  for 
the  aircraft  in  question  in  an  estimating  equation  and 
solving  for  Y.  Y*  is  an  equation  produced  estimate  of 
either  Y^  or  Y&. 

Y*-Y  The  deviation  between  equation  produced  estimate  of  Y 
and  the  actual  value  of  Y  used  to  form  the  estimating 
equation. 
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Abstract 

This  study  examines  the  first  unit  labor  cost  para^ 
meter  of  the  learning  curve  equations  belonging  to  seven  dif¬ 
ferent  aircraft.  The  purpose  of  this  investigation  is  to 
find  a  method  which  will  produce  good  estimates  of  this 
parameter  prior  to  the  start  of  production  of  an  aircraft. 

It  was  found  that  simple  linear,  and  log  linear,  mul¬ 
tiple  variable  relations  could  not  provide  an  accurate  esti¬ 
mate.  However,  non-linear  functions  of  weight  and  time  were 
able  to  estimate  historical  data  within  4^  of  the  actual 
value.  It  is  concluded  that  equations  of  this  form  should 
lead  to  very  accurate  estimates  of  labor  cost. 
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AN  ANALYSIS  OP  FIRST  UNIT 
LABOR  COSTS  FOR  FIXED 
WING  AIRCRAFT 

I.  Introduction 

This  analysis  concerns  estimating  the  total  labor 
costs  to  manufacture  the  first  airframe  of  a  particular  air¬ 
craft  type;  i.e.,  the  total  labor  costs  for  the  first  B-52A 
produced.  It  does  not  concern  itself  with  the  costs  of 
changing  from  one  series  to  another;  i.e.,  from  the  B-52A  to 
B-52B.  Neither  does  it  investigate  specifically  the  costs  to 
produce  units  after  the  first  one  or  the  relation  of  these 
later  units  to  the  first  one  produced.  The  term  labor  costs 

i 

in  this  analysis  refers  to  the  number  of  direct  labor  manu¬ 
facturing  hours  required  to  build  the  first  aircraft  produced 
and  not  the  dollar  value  of  these  hours.  This  analysis 
examines  seven  aircraft  produced  by  a  single  manufacturer. 

The  general  mission  for  which  these  aircraft  were  designed 
and  their  size  are  similar.  Since  this  constitutes  only  a 
small  segment  of  the  total  airframe  manufacturing  industry, 
it  would  be  unwise  to  apply  the  results  of  this  study  indis¬ 
criminately  without  first  verifying  that  the  results  apply  to 
the  aircraft  in  question. 

Background 

People  in  the  business  of  cost  estimating  have  long 
known  that  the  estimates  they  provide  are  not  likely  to  be 
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very  accurate  unless  the  cost  estimate  concerns  something 
that  has  been  done  several  times  before.  In  fact,  if  they 
are  estimating  the  cost  of  a  new  structure,  particularly  one 
that  is  not  an  extension  of  a  previous  structure,  they  rea¬ 
lize  that  a  three  or  four  fold  error  in  their  estimate  is  not 
unlikely.  Summers  (Ref  37:12-15)  cites  several  examples  of 
cost  estimates  given  for  the  construction  of  canals,  rail¬ 
roads  and  nuclear  power  stations.  Early  estimates  were  off 
by  a  factor  of  from  two  to  ten.  The  earlier  in  the  program 
life  the  estimates  v/ere  made,  the  poorer  they  tended  to  be. 
Original  estimates  are  almost  always  on  the  low  side.  In  ex¬ 
amining  68  cost  estimates  on  22  major  military  programs,  he 
found  the  estimates  to  be  from  15$  to  150$  of  actual  program 
costs.  80$  of  the  estimates  examined  were  low.  Actual  costs 
are  likely  to  be  three  times  estimated  costs  if  the  estimate 
was  made  early  in  the  program  and  the  program  was  technologi¬ 
cally  difficult.  The  need  to  re-examine  old  estimating  pro¬ 
cedures  and  to  search  for  new  and  better  methods  is  clear. 

Nearly  all  estimates  for  labor  costs  in  the  airframe 
industry  center  around  the  use  of  the  log  linear  direct  labor 
learning  curve.  There  are  several  varieties  of  this  curve 
and  explanations  of  their  differences  can  be  found  in  many 
sources.  The  most  frequently  cited  and  detailed  source  is 
Asher  (Ref  3i 15-63 ).  Another  good  source,  though  not  so  de¬ 
tailed  is  Brewer  (Ref  7:43-66).  The  essential  feature  of  all 
these  curves  is  that  the  number  of  labor  hours  actually  re¬ 
quired  to  assemble  an  airframe  follows  a  nearly  straight  line 
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when  plotted  on  log  log  graph  paper  against  the  cumulative 
number  of  units  produced*  The  reason  why  labor  hours  react 
in  this  manner  is  not  clear.  Nevertheless,  equations  can  be 
developed,  after  the  fact,  that  match  almost  perfectly  the 
labor  hours  required  for  programs  already  completed  These 
equations  differ  from  program  to  program,  but  they  are  all  of 
the  same  form: 

Y  =  AXb  (1) 

where 

Y  is  the  number  of  hours  required  to  produce 
the  Xth  unit  (or  the  number  of  hours  per 
AMPR  pound  of  airframe  weight) 

X  is  the  cumulative  unit  produced  number 
A  is  a  parameter  whose  value  is  equal  to  the 
number  of  hours  (or  hours  per  pound)  to 
produce  the  first  aircraft 
b  is  a  parameter,  unique  to  each  individual 
program  usually  referred  to  as  the  slope 

It  should  be  clear  that  if  the  parameters  A  and  b  can 
be  predicted  accurately  in  advance  of  starting  production, 
then  labor  costs  could  also  be  predicted  very  accurately. 

But  predicting  A  and  b  is  where  the  difficulty  lies.  It 
might  be  useful  to  consider  a  realistic  example  in  order  to 
develop  some  appreciation  for  the  size  of  errors  that  might 
be  introduced  by  only  moderate  prediction  errors. 

Suppose  that  the  direct  labor  hours  required  for  the 
first  unit  was  estimated  at  15  hours  per  pound  of  AMPR  weight 
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prior  to  the  start  of  production.  Suppose  also  that  the 
learning  curve  for  the  proposed  aircraft  was  estimated  to  have 
a  74#  slope.  After  production,  it  is  observed  that  the  first 
unit  estimate  was  only  60#  of  the  actual  first  unit  costs.  It 
is  also  observed  that  the  slope  estimate  was  exactly  correct. 
These  figures  are  ficticious,  but  they  represent  a  very  rea¬ 
listic  description  of  what  has  often  happened.  In  this  case, 
actual  labor  costs  to  produce  the  first  100  aircraft  would  be 
173#  above  the  costs  first  estimated.  But  labor  costs  are  a 
major  cost  element  in  the  first  100  aircraft  produced.  So  a 
first  unit  estimating  error,  that  could  be  viewed  as  accep¬ 
table  in  the  contexx  of  a  historical  examination  of  estimating 
errors,  has  led  to  a  major  error  in  the  estimated  costs  for 
the  first  100  aircraft  produced. 

The  estimating  method  just  described  illustrates  one 
method  of  estimating  labor  cosfcfa  for  a  proposed  program;  i.e., 
estimating  A  (the  hrs/lb  for  unit  one)  and  b  (the  learning 
curve  slope).  An  alternative  method  (Ref  13s  IV- 1,  IV-2) 
would  be  to  estimate  the  hours  per  pound  for  two  or  more  units 
and  then  to  solve  algebraically  for  the  slope.  When  this 
method  is  used,  the  points  estimated  are  usually  other  than 
the  first  unit,  so  the  parameter  A  mast  also  be  determined. 

It  is  also  common  to  estimate  the  labor  costs  for  unit  100  and 
the  learning  curve  slope  and  then  to  solve  for  the  parameter 
A  (Ref  28:46-^3).  With  both  of  these  later  methods,  rela¬ 
tively  minor  errors  in  the  estimates  used  can  cause  major  er¬ 
rors  in  the  resulting  values  of  parameter  A.  No  method  is 


* 


4 


GSA/SM/70-17 


presently  available  to  put  reasonable  bounds  on  the  values 
that  could  be  assigned  to  A  other  than  intuition. 

The  common  way  to  apply  intuition  has  been  to  examine 
historical  records  for  aircraft  of  similar  size  and  mission. 
The  first  unit  labor  costs  from  these  historical  records  are 
adjusted,  according  to  methods  that  vary  from  company  to  com¬ 
pany,  to  produce  an  estimate  for  unit  one  of  the  new  program. 
The  chief  cause  of  estimating  errors  is  the  small  number  of 
aircraft  that  are  of  similar  size  and  mission  to  the  one  being 
proposed.  Rational  adjustments  applied  to  only  one  or  two 
previously  produced  aircraft  can  easily  produce  gross  estima¬ 
ting  errors.  Air  Force  Negotiation  Memorandum  and  Contract 
Proposals  contain  miuiy  examples  of  such  estimates  that  were 
made  in  a  seemingly  rational  way  befo:  e  the  program  was 
started,  and  that,  after  the  fact,  were  greatly  in  error. 

One  objective  of  this  analysis  is  to  provide  some  limits  on 
what  might  ba  considered  a  good  estimate  of  the  parameter  A. 

The  Problem 

The  problem  to  be  investigated  in  this  analysis,  then, 
may  be  summarized  in  this  way.  What  techniques  and  mathema¬ 
tical  relations  may  be  used  to  estimate  the  labor  costs  to 
produce  the  first  airframe  of  a  new  type  of  aircraft?  What 
reasonable  bounds  exist  for  such  estimates?  The  objective  of 
this  research  is  to  demonstrate  that  mathematical  relations 
may  be  developed  to  accurately  predict  the  values  for  the 
labor  cost  of  the  first  airframe  produced.  Since  this  study 
uses  data  from  only  one  manufacturer  and  on  only  one  type  of 
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aircraft,  it  is  unlikely  that  the  relations  developed  here 
will  apply  to  the  entire  fixed  wing  airframe  industry. 

The  Analytical  Approach 

The  analytical  approach  presented  in  this  study  is 
flexible.  It  is  not  restricted  by  preconceived  ideas  as  to 
the  functional  form  resulting  estimating  equations  must  have. 
The  form  is  determined  by  the  data  available  for  analysis  and 
certain  underlying  assumptions.  The  chief  assumption  is  that 
the  labor  costs  are  related  in  some  identifiable  way  to  some 
or  all  of  the  following  factors: 

1.  Weight,  speed,  size,  power,  or  some  other 
engineering  factor  that  is  qur  Tfcifiable. 

2.  Management  and/or  engineering  competence. 

3.  Experience  gained  through  the  production  of 
aircraft  that  are  similar  to  the  one  being  proposed. 

4.  Technology,  in  the  sense  that  as  time  passes 
a  given  task  becomes  easier  to  do  when  new  knowledge  is  ap¬ 
plied  to  the  performance  of  the  task. 

Variations  in  the  way  time  and  effort  and 
materials  are  used  in  the  research,  development  and  planning 
that  precede  the  production  of  a  r.ew  aircraft. 

It  is  also  assumed  that  these  factors  may  be  arranged 
in  some  meaningful  mathematical  way  to  predict  labor  costs. 

As  many  combinations  of  these  factors  will  be  tested  as  the 
available  data  will  allow. 

One  factor,  that  of  management  and  engineering  compe¬ 
tence,  must  be  abandoned  at  the  outset.  Sines  only  one 
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manufacturer  was  willing  to  provide  detailed  information 
about  his  manufacturing  methods,  it  is  impossible  to  compare 
these  methods  to  those  of  another  manufacturer.  Without  such 
comparison,  few  valid  conclusions  can  be  made* 

This  study  describes  a  search  for  a  mathematical  re¬ 
lation  that  will  predict  labor  costs  for  aircraft  of  all 
sizes  produced  in  any  time  period.  The  intuitive  guide  used 
in  formulating  and  investigating  various  relationships  can  be 
stated  as  follows:  Labor  costs  should  be  related  in  some  way 
to  the  engineering  complexity  of  the  aircraft  in  question. 
Engineering  complexity  might  be  expressed  mathematically  as  a 
function  of  weight,  speed,  thrust,  payload-  or  other  perfor¬ 
mance  factors  taken  singly  or  in  combination.  When  a  suitable 
statement  of  engineering  complexity  is  selected,  it  must  be 
adjusted  for  time  before  comparisons  can  be  made  between  air¬ 
craft  that  were  not  produced  at  the  same  time.  The  point 
here  is  that  some  mathematical  function  of  time  can  be  used 
to  account  for  differences  in  the  state  of  the  art  when  air¬ 
craft  are  produced  in  different  years.  When  a  suitable  com¬ 
bination  of  engineering  complexity  and  time  is  selected  and 
historical  data  is  examined  in  relation  to  this  selected  ex¬ 
pression,  the  labor  costs  for  the  production  of  certain  air¬ 
craft  will  seem  unusually  high  and  others  will  seem  unusually 
low.  Where  this  trend  is  consistent,  the  variation  might  be 
reasonably  explained  by  differences  in  the  way  resources  were 
used  in  research  and  development  effort  for  the  aircraft  in 
question,  or  in  the  way  resources  were  used  in  the  planning 

for  the  production  of,  the  aircraft. 
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These  ideas  provided  the  basis  for  gathering  data  and 
selecting  variables  for  the  graphic  and  mathematical  analysis 
that  follows.  Before  turning  to  these  subjects,  it  might  be 
well  to  outline  some  of  the  fundamental  properties  of  the  di¬ 
rect  labor  3  earning  curve  since  much  of  the  data  used  in  this 
analysis  was  obtained  directly  from  these  curves. 
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II.  Preliminary  Relations 

This  study  makes  much  use  of  a  few  symbols  with  spe¬ 
cific  and  rather  narrow  meanings.  Additionally,  much  of  the 
data  analyzed  is  obtained  through  constructing  the  direct  la¬ 
bor  learning  curves  for  specific  aircraft  and  extracting  data 
from  the  equations  for  these  learning  curves.  For  these  rea¬ 
sons,  it  is  desirable  tc  discuss  briefly  the  fundamental 
mathematical  relations  of  the  learning  curve  and  how  learning 
curves  are  obtained. 

The  Learning  Curve 

It  is  a  frequently  observed  fact  in  the  airframe  in¬ 
dustry,  ar.d  other  indue  tries  as  well,  that  the  number  of 
hours  required  to  perform  a  task  react  in  a  very  predictable 
way  when  the  task  is  repeated  many  times.  If  it  takes  10 
hours  to  perform  a  task  after  it  has  been  done  several  times 
and  learning  is  occurring,  then  it  will  only  take  seme  frac¬ 
tion  of  10  hours  to  do  the  task  when  it  has  been  performed 
twice  as  many  times.  For  example,  if  it  takes  10  hours  to  do 
a  task  the  tenth  time  it  is  performed,  and  learning  is  occur¬ 
ring  at  a  80$  rate,  then  it  will  take  only  eight  hours  to 
perform  the  task  on  the  20th  repetition  and  6.4  hours  the 
40th  time.  The  time  required  would  continue  to  be  reduced  to 
0.8  of  its  starting  value  each  time  the  number  of  repetitions 
is  doubled.  This  relationship  can  be  expressed  by  the  mathe¬ 
matical  equation 

Y  =  AXb 
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Y  is  the  number  of  hours  required  tc  perform 
the  task 

X  is  the  number  of  times  the  task  has  been 
performed 

A  is  the  time  to  perform  the  task  the  first 
time  (21  in  this  example) 
b  is  the  arithmetic  slope  of  the  learning 
curve  equation  (-.32  in  this  example) 

If  either  the  equation,  or  the  actual  observed  values, 
for  Y  and  X  were  to  be  plotted  on  regular  rectangular  coordi¬ 
nate  graph  paper  the  results  would  look  like  Fig.  1. 


Fig.  1 .  Y  =  AXb 

If  equation  (2)  or  the  actual  observed  values  of  Y  and 
X  were  to  be  plotted  on  log  log  graph  paper,  the  results  would 
be  quite  different.  The  curve  would  appear  as  a  straight  line 
similar  to  the  graph  shown  in  Fig.  2.  The  same  straight  line 
can  be  produced  in  another  way.  If  equation  (2)  is  trans¬ 
formed  by  taking  the  logarithm  of  both  sides,  the  resulting 

equation  is  Log(Y)  =  Log(A)  +  bLog(X)  (3) 
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Pig.  2.  Y  =  AXb 
(on  log  log  graph  paper) 

If  equation  (3)  is  plotted  on  normal  rectangular  graph  paper, 
the  results  would  "be  the  same  as  shown  in  Pig.  2.  Plot  tin-  a 
multiplicative  equation  on  log  log  paper  then  is  equivalent 
to  plotting  the  log  of  the  equation  on  regular  paper.  Plot¬ 
ting  on  log  log  paper  is  very  much  simpler  since  it  allows 
plotting  the  original  Y  and  X  values  without  looking  up  the 
logarithms  first.  If  one  is  only  concerned  with  the  shape  of 
the  logarithmic  graph,  then  log  log  paper  is  very  useful. 

The  cnaracteristic  of  the  equation  Y  =  AXb  that  makes 
it  appear  as  a  straight  line  on  log  log  paper  is  the  value  of 
the  parameter  b.  As  long  as  it  is  a  constant  (i.e.,  it  does 
not  change  with  changing  values  of  X)  then  the  log  log  plot 
will  be  a  straight  line.  If,  when  the  observed  values  of  Y 
and  X  are  plotted  on  log  log  paper,  the  result  is  not  a 
straight,  line,  then  the  equation  Y  =  AXb  with  b  equal  to  a 
constant  is  not  an  appropriate  model  for  the  relationship 
being  investigated. 
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In  the  airframe  industry,  the  number  of  direct  labor 
hours  required  to  assemble  an  aircraft  are  not  recorded  for 
each  aircraft  that  comes  off  the  assembly  line.  Instead,  the 
hours  are  recorded  by  lots  of  varying  size.  What  must  be 
done,  if  the  learning  curve  is  to  be  plotted,  is  to  find  the 
average  number  of  hours  required  to  assemble  each  aircraft  in 
the  lot  (i.e.,  divide  the  total  number  of  hours  for  the  lot 
by  the  number  of  aircraft  in  the  lot).  The  lot  mid  point  must 
also  be  calculated.  The  deseed  value  is  the  total  number  of 
aircraft  produced  up  to  and  including  the  lot  mid  point.  It 
is  calculated  in  this  manner.  If  the  mid  point  for  lot  num¬ 
ber  4  is  desired  and  there  were  30  aircraft  prodviced  before 
lot  4  was  produced  and  there  are  10  aircraft  in  lot  4 ,  then 
the  cumulative  number  for  the  mid  point  of  lot  4  would  be 
found  by  adding  one  half  of  the  size  of  lot  4  (5)  to  the  last 
cumulative  number  of  the  previous  lot  (30)  for  a  cumulative 
lot  mid  point  number  of  35. 

One  further  adjustment  is  usually  made  before  the 
curve  is  plotted.  When  it  is  intended  to  compare  one  air¬ 
craft  learning  curve  to  another,  the  direct  labor  hours  are 
usually  divided  by  the  AMPR  weight  of  the  aircraft  being  pro¬ 
duced.  It  is  this  learning  curve  in  hours  per  pound  plotted 
against  the  cumulative  units  produced  that  is  usually  used  when 
comparing  two  or  more  different  aircraft. 

Identifying  the  parameters  of  the  equation  that  best 
describes  the  plotted  learning  curve  is  usually  done  by  com¬ 
puter.  The  hours  per  pound  and  the  corresponding  cumulative 
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lot  aid  points  are  converted  to  their  logarithmic  values  and 
a  simple  linear  regression  is  then  performed  cn  these  loga¬ 
rithmic  values.  The  equations  that  are  used  to  perform  this 
regression  are  not  relevant  to  this  paper.  They  can  he  found 
in  any  good  statistics  or  econometrics  text  book.  Johnston 
(Ref  24:3-24)  is  an  outstanding  text  as  is  Goldberger  (Ref 
18} <  Fisher  (Ref  16:38-49)  treats  an  example  of  logarithmic 
regression  analysis  an  considerable  detail. 

The  results  of  the  regression  will  describe  the  para¬ 
meters  A  and  b  in  either  one  of  two  ways  or  both.  If  the  re¬ 
sults  are  from  a  simple  logarithmic  regression  program,  then 
the  intercept  coefficient  trill  be  Log  (A)  and  the  slope  coef¬ 
ficient  will  be  the  arithmetic  value  v?  b.  If  the  program 
used  has  been  tailored  for  learning  curve  analysis,  then  A 
will  be  expressed  in  the  actual  value  of  hours  per  pound  for 
the  first  unit  assembled  and  b  will  be  in  a  percentage  de¬ 
scribing  the  rate  of  learning.  The  relation  between  A  and 
Log(A)  can  be  found  in  a  logarithm  table.  The  relation  be¬ 
tween  the  percent  of  learning  and  the  arithmetic  slope  (b) 
can  be  found  from  the  equation: 

log  .  b  .  Log(2)  (4) 

One  additional  relation  is  worth  pointing  out.  That 
is  the  relation  between  the  costs  to  produce  first  unit  and 
the  parameter  A  of  equation  Y  =  AX  .  It  has  been  noticed  em¬ 
pirically  that  actual  learning  in  the  airframe  industry  can 
be  closely  matched  after  the  fact  by  equations  of  this  form. 
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The  differences  in  equation  valuee  and  actual  observed  values 
are  usually  minor,  but  these  values  are  not  the  same «  If  Y  = 
AX^3  is  solved  for  the  first  unit  labor  costs  per  pound  (i.e«, 
X  =  1)  we  find  that  Y  -  A  is  this  cost.  This  either  may  or 
may  not  be  the  actual  labor  cost  for  the  first  unit  produced. 
Since  labor  costs  are  gathered  by  lot  it  would  be  nearly  im¬ 
possible  to  determine  if  they  were  the  same  or  not.  But  it 
is  really  not  important  whether  first  unit  labor  costs  is 
equal  to  A  or  not.  We  are  really  interested  in  the  parameter 
A  of  the  equation  that  best  describes  the  labor  costs  of  the 
whole  program.  The  fact  that  this  equation  can  be  solved  for 
an  estimated  value  of  first  unit  coats  does  not  mean  that  ac¬ 
tual  first  unit  costs  would  be  the  value  of  A  that  would  best 
predict  labor  costs  for  the  whole  program.  The  analysis  that 
follows  will  be  focused  on  ways  to  estimate,  in  advance  of 
actual- production,  the  values  that  A  will  most  likely  have. 

It  is  only  in  this  sense  that  this  study  is  an  analysis  of 
first  unit  labor  costs. 
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III .  The  Data  and  Its  Adjustment 

A  considerable  amount  of  data  in  many  forms  was  re¬ 
viewed  for  appropriateness  to  this  analysis*  Most  of  the 
data  was  not  in  a  form  that  would  allow  the  analysis  to  be 
performed  on  it  directly.  It  was  necessary  to  make  several 
adjustments  in  order  to  have  the  various  programs  comparable 
and  to  transform  other  information  into  a  form  that  could  be 
treated  mathematically.  The  following  paragraphs  outline  the 
data  that  was  selected  and  how  it  was  adjusted  before  the 
analysis  was  made.  Figures  that  are  provided  have  been  al¬ 
tered  to  protect  privileged  information.  They  do,  however, 
retain  the  essential  characteristics  of  the  actual  data. 

Direct  Labor  Data 

Direct  labor  is  one  of  the  major  expenditure  classi¬ 
fications  used  in  the  airframe  manufacturing  industry.  Other 
major  expenditure  classifications  are  Engineering  hours, 
Tooling  hours,  Materials,  Overhead  and  General  Administration. 
There  is  no  standard  way  to  classify  expenditures  in  the  air¬ 
frame  industry.  The  classifications  vary  in  name  and  content 
from  firm  to  firm.  They  are  similar  however  and  no  serious 
problem  would  be  encountered  in  transforming  the  accounting 
classifications  of  two  different  firms  into  comparable 
figures.  Since  all  the  data  used  in  this  analysis  was  pro¬ 
vided  by  one  manufacturer,  no  alteration  wes  necessary. 

Since  the  data  gathered  is  privileged  information, 
its  source  and  the  aircraft  they  represent  will  not  be 
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identified.  The  aircraft  will  be  designated  by  arbitrary 
numbers,  1  to  7,  assigned  randomly. 

The  direct  labor  hour  figures  gathered  represent  those 
hours  expended  by  workmen  in  machining,  processing,  fabrica¬ 
ting  and  assembling  the  integral  parts  of  the  airframe  struc¬ 
ture.  Asher  (Ref  3:49)  expands  and  treats  this  definition  in 
considerable  detail.  Another  classification  is  presented  by 
Dei  Rossi  (Ref  13:11-2,  II-6).  The  classification  used  by 
the  manufacturer  providing  the  data  is  consistent  with  these 
definitions.  The  data  was  available  in  both  hour  amounts  and 
dollar  amounts.  Only  hour  amounts  were  collected  to  eliminate 
the  requirement  to  consider  changing  wage  and  price  rates. 

As  mentioned  previously,  the  hour  figures  were  not  re- 
coided  by  individual  aircraft.  Instead,  they  were  recorded 
by  consecutive  groups  of  aircraft  or  lots.  These  lot  sizes 
varied  from  two  to  30  with  the  larger  lot  sizes  usually  occur¬ 
ring  later  in  the  production  schedule.  By  dividing  the  total 
number  of  hours  required  to  produce  the  lot  by  the  lot  size, 
the  average  number  of  hours  per  aircraft  in  the  lot  was  ob¬ 
tained  , 

Information  on  all  lots  produced  was  not  gathered. 

Since  the  analysis  is  concerned  only  with  the  characteristics 
of  the  first  portion  of  the  production  run,  unit  100  was  se¬ 
lected  as  a  cut  off  point.  Lot  information  containing  aircraft 
produced  after  unit  100  was  not  collected. 

To  make  the  hour  figures  for  large  aircraft  comparable 
to  the  hour  figures  of  smaller  aircraft,  all  hour  figures 
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were  divided  by  the  AMPR  weight  of  the  aircraft  concerned. 

But  the  total  AMPR  weight  of  the  aircraft  could  not  be  used 
because  there  was  considerable  variation  in  the  amount  of  the 
aircraft  that  was  actually  produced  and  assembled  in  the 
plant.  What  was  used  was  a  figure  called  in-plant  AMPR 
weight.  It  represents  the  /JflPR  weight  of  that  portion  of  the 
aircraft  that  was  actually  produced  and  assembled  in  the 
plant.  The  resulting  hours  per  in-plant  AMPR  pound  then  de- 
sacribes  or  approximates  a  figure  that  would  have  been  rea¬ 
lized  if  the  entire  aircraft  had  been  produced  and  assembled 
in  the  plant. 

There  i6  one  more  series  of  adjustments  necessary  be¬ 
fore  the  learning  curves  for  the  seven  aircraft  can  be  con¬ 
structed  and  compared.  Labor  hour  data  for  several  of  the 
aircraft  began  with  two  prototype  aircraft.  This  was  parti¬ 
cularly  true  of  the  older  programs.  But  prototype  production 
is  considerably  different  from  the  production  of  the  first 
two  aircraft  of  a  production  run.  Prototypes  are  built  in 
response  to  a  company  idea  that  a  particular  type  of  aircraft 
might  be  salable  or  in  response  to  a  request  from  a  potential 
buyer.  In  either  case  prototype  construction  is  essentially 
a  one  time  construction  of  an  aircraft  to  see  if  the  design 
is  feasible  and  salable.  It  is  recognized  that  certain  bene¬ 
fits  may  be  realized  that  vri.ll  make  future  production  easier, 
but  prototype  construction  may  precede  actual  production  by 
years  and  it  is  more  nearly  a  one  time  construction  project 
rather  than  the  start  of  a  production  run. 
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Where  prototype  production  was  included  in  the  direct 
labor  data  it  was  removed  and  counting  of  cumulative  units 
began  with  the  first  aircraft  of  the  normal  production  run. 
Average  direct  labor  hours  per  in-plant  AMPR  poun\  per  air¬ 
craft  wa3  calculated  and  regressed  on  the  cumulative  lot  mid 
points  to  produce  the  estimated  first  unit  labor  cost  desig¬ 
nated  Y.j . 

It  was  felt  that  an  alternative  first  unit  labor  cost 
reflecting  what  might  have  been  learned  during  prototype  con¬ 
struction  would  also  be  useful.  There  is  no  recognised  method 
for  adjusting  learning  curves  for  learning  that  occurs  prior 
to  the  beginning  of  production  so  the  following  method  was  de¬ 
vised.  Reasoning,  as  above,  that  the  production  of  two  pro¬ 
totypes  does  not  produce  the  same  learning  as  the  production 
of  the  first  two  aircraft  of  a  production  run,  it  must  pro¬ 
duce  learning  equivalent  to  some  fraction  of  two  production 
aircraft.  But  since  prototype  programs  vary  in  the  amount  of 
leo.d  time  before  production  is  started,  can  be  significantly 
different  in  design  from  the  production  aircraft,  and  can  be 
produced  at  a  different  facility,  it  is  not  reasonable  to  as¬ 
sign  the  same  fraction  of  two  to  all  prototype  programs. 
Considering  these  factors,  and  some  others  of  a  privileged 
nature,  each  prototype  program  was  viewed  separately  and  an 
intuitive  judgment  was  made  as  to  the  amount  of  pre-production 
learning  it  could  have  produced. 

All  of  the  aircraft  programs  were  then  considered  to¬ 
gether  and  ranked  according  to  the  amount  of  pre-production 
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learning  their  prototype  programs  could  have  produced.  ?or 
example,  an  aircraft  program  whose  prototype  was  produced  in 
response  to  a  buyer  request  and  where  prototype  construction 
and  production  occurred  at  different  facilities  would  be 
ranked  below  a  company  conceived  prototype  produced  in  the 
production  facility. 

This  ranking  was  then  converted  to  a  number  between  0 
and  1  and  added  to  the  cumulative  lot  mid  points  obtained 
earlier.  Zero  was  assigned  to  those  programs  that  did  not 
have  a  prototype  program.  The  higher  numbers  v/ere  assigned 
to  those  programs  where  a  maximum  of  pre-production  learning 
might  have  occurred.  The  earlier  calculated  average  hours 
per  pound  per  aircraft  was  then  regressed  on  the  new  lot  mid 
points.  The  first  unit  labor  coots  from  these  adjusted 
curves  was  designated  Y_, 

ct 

A  summary  of  the  adjustments  and  the  values  of  Y- ,  Y_, 

I  a 

and  Y„  .  are  shown  in  Table  I.  The  values  have  been  altered 
to  protect  privileged  information,  but  they  still  retain  their 
essential  relationships.  The  direct  labor  learning  curves 
with  prototype  information  included  and  no  adjustments  made 
are  shown  in  Fig.  3.  For  comparison,  Fig.  4  shows  the  learn¬ 
ing  curves  after  the  prototype  information  has  been  removed 
and  the  above  adjustments  riade.  Notice  that  the  curves  in 
Fig,  4  have  ,nuch  more  similarity  in  slope  and  intercept  than 
those  in  Fig.  3.  One  convention  will  be  adopted  at  this 
point.  All  future  graphs,  as  well  as  Figs.  3  and  4,  will  be 
on  log  log  coordinate  graph  paper  unless  specifically  noted 


otherwise. 
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TABLE  I 

ESTIMATED  FIRST  UNIT  LABOR  COSTS 


Aircraft 

Adjustment 

Factor 

Ya 

Ya-1 

1 

25.87 

.75 

28.49 

28.49 

2 

21.93 

.75 

24.71 

24.71 

3 

17.12 

.75 

19.77 

18.75 

4 

19. CO 

.50 

20.28 

25.42 

5 

23.39 

.25 

24.45 

24.65 

6 

20.22 

0.00 

20.22 

20.90 

7 

27.31 

0.00 

27.31 

27.31 

Note:  Symbols  defined  on  page  xi. 
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Fig,  3.  Direct  Labor  Learning  Curves 
(from  unadjusted  raw  data) 


ect 


GSA/SM/70-17 


Two  additional  pieces  of  information  are  taken  from 
the  curves  that  produced  Ya>  The  first  is  the  cur/e  value 
for  the  100th  unit  produced  and  it  is  designated  Y1Q0.  The 
second  is  found  by  extrapolating  from  the  first  let  mid  point 
at  its  adjusted  value,  to  the  Y  axis  using  the  slope  of  the 

Y  learning  curve.  This  value  could  be  considered  a  better 
approximation  of  what  the  actual  first  unit  costs  would  be 
than  either  Y^  or  Y&.  This  extrapolated  value  is  designated 

Y  „  and  is  also  shown  in  Table  I. 
a— 1 

Performance  Data 

It  was  originally  intended  to  gather  as  many  variaoles 
as  possible  that  could,  in  any  way,  be  considered  as  a  mea¬ 
sure  of  engineering  complexity  for  the  aircraft  in  question. 
Data  such  as  the  number  of  design  studies,  the  number  of  en¬ 
gineering  drawings,  the  number  of  parts,  airspeeds  that  re¬ 
flect  structural  limitations  at  low  and  high  altitude,  power 
ratios  and  any  others  that  might  be  available  were  sought. 

Some  aircraft  under  consideration  had  all  of  these  variables 
available.  Most  though,  had  only  one  or  two.  A  choice  had  to 
be  made  between  severely  limiting  the  number  of  aircraft  being 
studied  or  accepting  only  a  few  simple  performance  factors  as 
measures  of  engineering  difficulty.  The  latter  choice  was  made. 

Weight.  Three  weight  figures  were  obtainable  for  all 
of  the  aircraft  under  consideration,  maximum  gross  weight  for 
take-off,  AMPR  weight  and  the  difference  between  the  two. 

The  difference  could  be  viewed  as  the  amount  of  weight  the 
AMPR  airframe  is  designed  to  carry.  Of  the  three  weights, 
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the  maximum  gross  weight  for  take-off  is  known  earlier  in  the 
aircraft  program  life  than  the  AMPR  weight.  Unless  the  AMPR 
weight  or  the  weight  difference  proves  to  be  a  significantly 
better  variable  in  estimating  Y,  maximum  gross  weight  will  be 
used  in  any  equations  that  might  be  developed. 

Speed.  There  were  many  speeds  to  choose  from  but  few 
of  these  were  arrived  at  in  a  consistent  manner  from  aircraft 
to  aircraft.  There  seemed  to  be  an  entirely  unique  set  of 
speeds  available  for  each  aircraft,  and  the  meaning  of  each 
set  could  only  be  understood  in  the  context  of  its  own  pro¬ 
gram.  It  would  have  been  necessary  to  have  considerable  time 
and  the  services  of  an  aeronautical  engineer  to  make  the 
speeds  comparable.  Neither  was  available  so  the  two  that 
seemed  most  consistent  were  selected,  maximum  true  airspeed 
at  altitude,  and  maximum  true  air  speed  at  sea  level.  Y/here 
these  were  in  indicated  air  speed  or  miles  per  hour  they  were 
converted  to  knots  true  air  speed,  later  examination  showed 
that  the  maximum  speed  at  sea  level  could  not  be  used  to  dis¬ 
tinguish  one  aircraft  from  another  in  any  meaningful  way  and 
it  was  discarded. 

Time 

The  year  and  month  was  recorded  when  the  first  expen¬ 
diture  occurred  in  the  manufacture  of  the  first  components  of 
the  first  lot  for  each  of  the  seven  aircraft  considered. 

This  date  will  be  used  to  represent  the  state  of  the  techno¬ 
logical  art  when  construction  of  the  aircraft  began.  However, 
calendar  year  without  some  modification  is  not  suitable  for 
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analytic  purposes.  Two  types  of  modification  vail  be  used, 
exponential  time  and  straight  line  time.  The  underlying  idea 
in  both  of  these  approaches  to  the  vase  of  time  i3  the  same* 

If  a  task  takes  Y  number  of  hours  to  do  today,  it  can  be  ex¬ 
pected  that  with  technological  advances  it  should  take  only 
some  fraction  of  Y  hours  to  do  the  same  task  ten  years  from 
now.  There  are  some  tasks  that  never  change  and,  of  course, 
this  approach  would  not  apply  to  them.  But  looking  at  an  in¬ 
dustry  as  a  whole,  or  looking  at  the  aggregation  of  many 
tasks,  such  a  reduction  in  the  required  hours  is  bound  to  oc¬ 
cur.  The  question  of  whether  the  saving  is  actually  realized, 
or  just  diverted  to  newly  created  task  will  not  be  investi¬ 
gated  directly.  Time  appears  as  a  significant  variable  in 
some  of  the  estimating  relations  developed  in  this  study. 

Its  importance  and  the  role  it  plays  will  be  examined  after 
the  relations  are  developed. 

Exponential  Time.  If  we  are  considering  two  points 
in  time  and  the  number  of  hours  to  do  a  task  at  each  of  these 
times,  an  exponential  time  approach  would  say  that  the  number 
of  hours  required  at  the  later  time  point  is  proportional  to 
the  time  difference  between  the  two  points  raised  to  a  nega¬ 
tive  power  between  zero  and  one.  That  is: 

Y2  a  Y1 *Texp  (5) 

where: 

Yg  =  the  number  of  hours  to  do  the  task  at  the 
later  time  point 
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Y.j  =  the  number  of  hours  to  do  the  task  at  the 
earlier  time  point 

T  =  the  number  of  years  between  the  two  point:; 
exp  =  a  negative  number  between  zero  and  one 

An  essential  feature  of  exponential  time  is  that  its 
effect  is  not  constant  over  different  time  intervals  of  the 
same  length.  This  can  be  seen  in  Pig.  5  which  approximates 
the  curve  of  T~*^*  The  points  a,  b,  and  c  are  spaced  an  equal 
distance  apart.  The  base  year  is  x  •  If  it  takes  Y  hours  to 
do  a  task  at  time  point  a,  Y  will  be  reduced  by  a  factor  AY 


when  the  task  is  performed  at  time  point  b.  After  another 
increment  of  time  of  the  same  length  the  task  becomes  reduced 

I  t 

by  an  increment  AY  .  AY  is  smaller  in  absolute  value  and  in 
percent  of  reduction  than  AY.  If  several  events  are  consi¬ 
dered  together  relative  to  one  po;.nt  in  time,  then  the  base 
year  becomes  very  important  since  it  will  determine  if  the 
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events  occur  during  the  steep  portion  of  the  curve  or  the 
relatively  flat  portion.  It  should  be  clear  that  the  lai'gest 
reductions  will  occur  in  the  first  portion  of  the  curve  and 
relatively  little  adjustment  is  made  jn  the  later  portion  of 
the  curve.  If  the  events  being  considered  do  not  relate  in 
this  way,  then  the  choice  of  exponential  time  as  adjustment 
factor  would  not  be  good. 

Straight  Line  Time.  An  alternative  to  the  exponential 
time  concept  that  provides  for  a  relatively  constant  reduction 
across  all  intervals  is  straight  line  time.  A  rate  of  reduc¬ 
tion  is  selected  that  is  to  be  applied  every  10  years,  say 
75$.  Then  for  every  hour  required  in  year  s;ero,  only  .75 
hours  will  be  required  ten  years  hence  and  ,?52  in  twenty 
years.  Time  treated  in  this  manner  will  be  denoted  T  ^  with 
the  subscript  denoting  the  rate  of  reduction.  Straight  line 
time  is  not  actually  a  straight  line,  but  it  is  so  nearly  so 
that  no  serious  distortion  is  created  in  the  25  year  time 
period  covered  by  this  analysis.  Both  T~’2^  and  T  n(-  are 

•  \J 

plotted  in  Pig.  6  across  twenty-five  years  to  show  the  dif¬ 
ference  in  the  way  time  reductions  amounting  to  nearly  the 
same  quantity  are  applied.  Table  II  shows  several  other 
pairs  of  exponential  and  straight  line  time  for  comparison. 

Research  and  Development  Data 

Because  of  the  privileged  nature  of  this  data,  it 
will  not  be  identified  or  discussed  except  to  mention  a  few 
of  its  characteristics.  This  data  was  collected  in  four 
categories  and  when  identification  is  necessary  these 
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5  10  15  20  26 

YEARS 

Fig.  6. 

Adjustment  Factor  vs  Time  for  T  75  and  T“'2^ 
(Rectangular  Coordinates) 
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TABLE  II 

COMPARISON  OF  EXPONENTIAL  AND  STRAIGHT  LINE  TIME 


Years 

<£-•15 

*.85 

ij—  *25 

T.75 

y— » 3 

T.65 

5 

.786 

.922 

.669 

.866 

.617 

.806 

10 

.708 

.850 

.562 

.75 

.501 

.65 

15 

.666 

.784 

.508 

.649 

.444 

.524 

20 

.638 

.723 

.473 

.563 

.407 

.423 

25 

.617 

.666 

.447 

.487 

.381 

.341 

29 


GSA/SM/70-17 


categories  will  be  referred  to  as  R&D  A  through  R&D  D,  They 
are  related  in  a  way  that  percentage  figures  are  meaningful 
and  they  can  be  normalized  by  dividing  by  AMPR  weight.  In 
other  words,  R&D  A  per  AMPH  pound  would  have  meaning  and  it 
could  be  plotted  if  desired. 

Subcontracting  Data 

Data  was  gathered  on  the  amount  of  subcontracting 
that  occurred  during  the  production  of  the  first  100  of  each 
of  the  aircraft  considered.  The  trend  was  for  the  amount  of 
subcontracting  to  increase  in  recent  years.  The  amount  of 
subcontracting  varied  considerably  depending  upon  production 
rate,  time  constraints  and  how  full  or  slack  the  aerospace 
industry  was  at  the  time  of  production.  Company  officials 
felt  that  subcontracting  had  both  good  points  and  bad.  Where 
competent  aerospace  industries  exist  that  specialize  in  one 
particular  type  of  component,  say  auxiliary  power  plants,  the 
experience  and  expertise  of  such  a  company  may  allow  them  to 
do  a  job  better  and  for  less  money  than  the  prime  contractor. 
However,  where  inexperienced  contractors  are  involved,  the 
experience  is  mixed.  Their  lack  of  experience  often  finds 
them  over  committed  or  performing  a  job  for  which  they  greatly 
underestimated  the  costs  aid  complexities.  In  this  case, 
things  go  badly.  The  overall  impression  is  that  if  subcon- 
tracting  is  used  wisely it  is  at  worse  neutral  and  more  pro¬ 
bably  a  benefit. 

This  completes  the  discussion  of  the  data  gathered 
for  this  study  and  how  it  was  modified  for  analysis.  The 
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next  chapter  presents  the  analytical  tools  that  will  he  used 
to  perform  the  analysis. 
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IV.  Analytical  Methods 

Three  general  mathematical  tools  will  be  used  to  ana¬ 
lyze  the  data  just  described.  They  are  graphic  analysis, 
multiple  regression  analysis,  and  curve  fitting  by  least 
squares.  The  purpose  of  this  chapter  is  to  explain  how  these 
tools  will  be  used  and  what  criteria  will  be  used  in  judging 
the  results. 

In  all  three  methods,  the  main  test  of  any  relation¬ 
ship  developed  will  be  its  ability  to  predict.  For  every  set 
of  variables  considered,  there  is  at  least  one  aircraft  pro¬ 
gram  whose  labor  cost  presents  a  problem  because  the  data  re¬ 
presenting  the  program  appear  out  of  agreement  when  viewed  in 
context  with  the  others.  The  aircraft  that  presents  the  pro¬ 
blem  varies  depending  on  the  particular  set  of  variables 
being  considered,  and  all  aircraft  become  a  problem  candidate 
at  one  time  or  another.  The  existence  of  these  problem  points 
provides  an  ideal  way  to  check  the  soundness  of  any  estimating 
relation  that  might  be  developed. 

Estimating  relations  are  evaluated  by  many  techniques 
most  of  which  are  related  to  some  measure  of  how  well  the  es¬ 
timating  relation  reproduces  the  historical  points  upon  which 
it  is  based.  If  estimating  relations  in  this  study  were  de¬ 
veloped  using  all  seven  data  points  the  resulting  relations 
would  produce  better  estimates  of  the  problem  points  than  if 
the  relations  were  developed  with  the;  problem  point  excluded 
and  only  the  six  remrining  points  used.  A  relation  developed 
without  the  problem  point  included  in  the  data  set  may  or  may 
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not  produce  a  good  estimate  of  the  problem  point.  If  it  pro¬ 
duces  a  poor  estimate,  then  there  is  good  reason  to  question 
the  validity  of  the  relation  even  if  it  produces  a  fairly 
good  estimate  when  the  problem  point  is  added  to  the  data  set 
and  the  relation  is  redeveloped. 

The  questions  raised  by  the  inability  of  a  relation 
to  accurately  point  toward  a  problem  point  are  important.  It 
could  mean  that  the  variables  chosen  for  use  in  the  estima¬ 
ting  relation  are  inappropriate  or  that  they  are  used  in  an 
inappropriate  functional  form.  It  could  also  mean  that  the 
aircraft  represented  by  the  problem  point  requires  a  differ¬ 
ent  technology  for  its  production  and  is  not  logically  re¬ 
lated  to  the  other  aircraft  in  the  data  set.  It  could  mean 
that  the  labor  costs  for  the  problem  point  aircraft  were  ad¬ 
versely  affected  by  some  other  factor  that  was  unique  to  that 
aircraft  program;  i.e.,  some  management  policy,  some  engineer¬ 
ing  technique,  or  a  major  change  in  the  program  in  mid  stream, 
etc.  A  poor  estimate  of  the  problem  point  will  not  confirm 
the  existence  of  any  one  or  all  of  these  possibilities.  But 
a  good  estimate  of  the  problem  point  allows  consideration  of 
these  possibilities  to  be  minimized.  It  also  says  that  if 
the  good  estimating  relation  had  been  known  and  used  before 
production  of  the  problem  aircraft  had  begun,  it  would  have 
been  of  value.  One  would  also  have  some  faith  in  the  ability 
of  an  estimating  relation  developed  in  this  manner  to  accurately 
predict  the  labor  costs  of  a  new  aircraft  that  represents  an 
extension  of  technology  past  the  technology  of  the  problem 
point  aircraft. 
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The  tern  critical  point  will  be  used  in  this  study  to 
indicate  the  problem  point  chosen  to  test  the  ability  of  an 
estimating  relation  to  predict.  The  aircraft  representing 
the  critical  point  will  vary  depending  on  the  particular 
variables  being  used  to  develop  the  estimating  relation.  In 
general,  it  will  be  the  point  that  appears  most  discontinuous 
when  viey.-ed  in  relation  to  the  other  six.  All  estimating  re¬ 
lations  will  be  developed  with  the  critical  point  removed 
from  the  data  set.  Those  relations  that  show  the  ability  to 
estimate  the  critical  well  will  be  selected  for  refinement 
and  further  testing. 

Graphic  Analysis 

Graphic  analysis  is  the  least  sophisticated  yet  the 
most  valuable  method  used  because  of  its  ability  to  improve 
understanding  of  how  variables  relate.  Nearly  every  combina¬ 
tion  of  variables  possible  was  graphed  but  only  those  that 
have  some  special  significance  are  included  here.  Those  that 
show  some  important  relationship  are  included  along  with  some 
that  show  a  complete  absence  of  any  recognizable  relations. 
This  is  done  to  show  why  certain  paths  of  analysis  were  pur¬ 
sued  and  others  omitted. 

Several  conventions  are  adopted  in  the  graphs  to  be 
presented.  First,  as  mentioned  previously,  since  the  bulk  of 
the  graphs  to  be  presented  were  constructed  on  log  log  paper, 
unless  specifically  noted  otherwise,  all  graphs  can  be  con¬ 
sidered  log  log.  Scales  will  be  emitted  from  all  graphs  to 
protect  privileged  information.  When  referring  to  a  specific 
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point  on  any  graph,  the  points  on  that  graph  will  be  denoted 
by  the  letters  a  through  g.  The  furthest  point  to  the  left 
will  be  designated  as  point  a  and  the  remainder  be  assigned 
consecutively  to  the  right.  Since  there  are  usually  seven 
points  on  each  graph,  the  right  most  point  would  be  point  g. 

The  criteria  for  judging  the  worth  of  a  graph  is  the 
reasonableness  of  an  assumption  that  there  exists  a  continu¬ 
ous,  relatively  smooth  underlying  function.  Where  this  as¬ 
sumption  seems  plausible,  arid  this  underlying  function  helps 
to  explain  most  of  the  points  on  the  graph,  then  a  search  for 
that  function  will  be  pursued. 

Multiple  Regression  Analysis 

A  computer  program  w&s  designed  to  perform  multiple 


regression  analysis  on  equations  of  the  form: 

Y  =  Bq  +  B1X1  +  BgXg  +  .....  +  +  E  (6) 

and 

Y  =  eBo  .  X^B1  . X.Bj  .  eE  (7) 


The  regression  on  equation  (7)  is  performed  by  taking  the 
logarithm  of  both  sides  which  gives  the  linear  equation: 

Log(Y)  =  Bq  +  B1.Log(X1)  +  ...  +  B^-LogCXj)  +  E  (8) 

In  the  analysis  performed,  Y  is  usually  hours  or  hours  per 
pound  and  the  X^'s  are  various  combinations  of  performance 
data  and  time.  E  iE  the  error  term.  The  computer  program 
solves  for  the  values  of  the  B^'s,  the  coefficient  of  aeter- 
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mination  (H^),  the  partial  coefficient  of  determination  for 

2 

each  independent  variable  (R^  ),  an  estimate  of  the  variance 

jL 

of  £  and  the  P  statistic. 

Statistical  tests  will  not  be  emphasized  in  the  se¬ 
lection  of  usable  estimating  equations  for  two  reasons. 

First,  to  use  the  P  statistic,  it  requires  the  assumption  of 
normality  in  the  distribution  of  E  with  a  mean  of  zero.  There 
are  few  good  reasons  for  making  thie  assumption  except  to 
provide  a  statistical  test.  Second,  and  more  important,  is 
that  the  number  of  observation':  being  regressed  is  not  large 
enough  to  support  statistical  conclusions  on  more  than  one 
variable  (Ref  26:60,  61 ). 

The  prime  measures  of  validity  will  be  the  ability  to 
predict  as  explained  above,  and  the  coefficients  of  determi- 

p 

nation  (R  )  and  the  partj.a.1  coefficients  of  determination 
2  2 

(Ri  ).  R  may  be  viewed  as  that  percent  of  the  variation  of 
Y  around  its  mean  that  is  explained  by  the  B^'s  and  the  X^’s 

of  the  regression  equation.  A  strong  relationship  would  have 

2  2 
a  R  near  1.0  and  a  weak  relationship  R  would  be  near  zero. 

2 

The  partial  coefficients  of  determination  (R^  )  may  be  viewed 

as  the  percent  of  remaining  variation  that  is  explained  by 

the  addition  of  the  ith  variable  to  the  regression  equation 

after  all  other  variables  have  been  regressed.  High  values 

2 

and  a  tendency  toward  equality  of  all  the  R^  ’  s  would  be  de¬ 
sirable  (Ref  18:199,  177;  25:31). 

The  equations  used  in  the  computer  program  and  the 
linear  algebra  necessary  to  solve  them  would  add  cnly  length 
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to  this  analysis  and  will  therefore  be  omitted.  The  computer 
program,  however,  has  been  included  as  Appendix  B,  The  above 
references  provide  de .  tiled  explanations  of  the  calculations 
and  their  interpretation  as  does  Fisher  (;?ef  16:  Section  VI). 

Curve  Fitting 

Where  the  requirement  for  linearity  or  log  linearity 
cannot  be  met,  or  where  the  functional  relationship  between 
two  or  more  variables  is  to  be  exactly  specified,  then  curve 
fitting  by  least  squares  is  used.  For  example,  if  a  graph 
shows  the  relation  between  Y  and  X  not  to  be  a  straight  line, 
it  might  be  desirable  to  derive  an  estimating  equation  in  the 
form  of  a  quadratic  such  as: 

Y  =  BQ  +  BnX  +  B^2  (9) 

The  above  computer  program  can  be  used  to  solve  for  the  Bi's 

2 

and  the  coefficient  of  determination.  R  may  then  be  used  as 
a  measure  of  goodness  of  fit  (Ref  19? 40). 

In  another  situation  it  might  be  desirable  to  apply 
an  adjustment  factor,  Z,  to  one  of  the  variables  and  fit  a 
quadratic  in  the  logarithm  of  the  resulting  variables.  An 
equation  of  this  form  would  be: 

Log(Y-Z)  =  Bq  +  B1«Log(X)  +  B2{Log(X))2  (10) 

The  solution  for  the  value  of  the  B^'s  in  this  equation  and 
any  other  where  the  exact  relationship  can  be  specified  is  a 
simple  matter  for  the  least  squares  curve  fitting  program. 

Its  major  limitation  is  the  imagination  of  the  user. 
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V.  Analysis  of  the  Data 

The  analysis  of  the  labor  cost  data  will  proceed  in 
three  distinct  phases.  The  first  will  be  graphic.  Hour 
labor  data,  both  adjusted  and  unadjusted  will  be  plotted 
against  the  various  weight,  speed,  snd  time  variables  with 
the  hope  of  identifying  suitable  variable  combinations  for 
the  second  phase,  the  mathematical  analysis.  Regression  ana¬ 
lysis  will  seek  functional  relationships  where  there  appear 
to  be  linear  or  log  linear  relations  between  the  variables. 
Where  the  graphic  analysis  suggests  that  there  is  a  non-linear 
relation,  assumptions  will  be  made  as  to  the  likely  form  of 
the  relation  and  curves  will  then  be  fitted  to  the  data.  The 
final  phase  of  the  analysis  will  be  statistical.  The  best 
functions  from  the  mathematical  analysis  will  be  selected  and 
statistical  tests  will  be  applied  to  judge  how  strong  a  rela¬ 
tion  exists  between  the  variables  and  how  well  the  selected 
functions  are  likely  to  predict  labor  costs  for  future  air¬ 
craft. 

Graphic  Analysis 

When  labor  costs  of  two  or  more  different  aircraft 
are  compared,  the  comparison  is  usually  done  in  terms  of  di¬ 
rect  labor  hours  per  AMPR  pound  of  airframe  weight.  Figs.  7, 
8,  and  9  plot  this  quantity  against  Time,  Weight,  and  Maximum 
Airspeed  respectively.  Y  is  the  general  symbol  adopted  for 
hours  per  AMPR  pound.  Y^  is  the  first  unit  labor  costs  found 
from  the  learning  curve  after  prototype  aircraft  have  been 
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Fig.  7.  First  Unit  Labor  Costs 
(in  hours/AKPR  pound)  Vs  Time 
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Pig.  8.  First  Unit  Labor  Costs 
(in  hours/AMPR  pound)  Vs  Weight 
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A  Yfl_j  vs.  Maximum  Speed 
a  Y&  vs.  Maximum  Speed 
o  Yj  vs.  Maximum  Speed 


Pig.  9»  First  Unit  Labor  Costs 
(in  hcurs/AMPR  pound)  Vs  Speed 
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removed.  Y&  is  Y1  adjusted  for  pre-production  learning.  The 
graphs  show  Y1  and  Y  .  Also  plotted  on  these  first  three 
figures  is  Ya  .  where  it  is  significantly  different  from  Ya. 
Ya_.  is  fo'ond  by  extrapolating  from  the  first  lot  mid  point 
to  the  Y  axis  using  the  slope  of  Y&.  Because  Y  is  so  si¬ 
milar  to  Y  .  except  at  one  point,  it  will  not  be  shown  on 
£1 

further  graphs. 

The  most  important  thing  to  note  from  these  figures 
is  that  no  one  variable  alone  presents  a  reasonable  explana¬ 
tion  of  labor  costs.  If  a  relation  is  to  be  identified,  it 
will  require  two  or  more  variables.  It  might  be  hypothesized 
that  there  is  an  underlying  shallow  U  shaped  curve  in  all 
three  cases,  or  that  both  Y  vs  Time  and  Y  vs  Weight  could  be 
represented  by  a  slightly  downward  sloping  straight  line. 

But  neither  assumption  is  clear  or  strong.  These  are  possi¬ 
bilities  that  will  be  investigated  further  in  the  mathemati¬ 
cal  analysis. 

Since  Y  is  labor  hours  divided  by  AMPR  weight,  it 
might  be  instructive  to  see  the  picture  of  laoor  hours  alone 
plotted  against  the  same  variables.  To  do  this,  Y^  and  Y& 
were  multiplied  by  their  respective  AMPR  weights  and  desig¬ 
nated  H1  and  Hq.  plotted  against  Time,  Weight,  and  Speed 
are  shown  in  Pigs.  ^ 0 ,  11,  and  12  respectively.  In  the  case 
of  time  and  speed,  there  is  no  recognizab?.e  relationship,  but 
this  is  not  so  in  the  case  of  weight.  Clearly,  a  straight 
line  or  a  shallowly  upward  curving  line  could  be  underlying 
the  vs  Weight  curve  of  Fig.  11. 
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Fig.  11.  First  Unit  Labor  Costs 
(in  hours)  Vs  Weight 
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H 

(hours) 


Fig. 


o  vs.  Maximum  Speed 


Maximum  Speed 


12.  First  Unit  Labor  Costs 
(in  hours)  Vs  Speed 
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These  f.rst  six  figures  were  all  plotted  on  log  log 
graph  paper.  Essentially  the  same  picture  is  presented  on 
rectangular  coordinate  paper  and  they  are  not  shown  for  that 
reason.  These  six  figures  point  the  way  for  the  mathematical 
analysis  that  follows,  namely  a  search  for  linear  and  log 
linear  combinations  of  two  or  more  variables  that  explain  Y 
and  H,  and  a  search  for  non-linear  functions  that  explain  the 
curve  of  H  vs  Y'eight. 


Mathematic  Analysis 

Linear  Regression.  The  first  series  of  regressions 
seek  to  relate  Y  to  speed,  weight,  and  time  as  the  indepen¬ 
dent  variables.  These  independent  variables  will  be  taken 
tv/o  and  three  at  a  time  by  selecting  one  each  from  the  fol¬ 
lowing  categories: 

1.  V 


2. 

3* 


V  V  Va 

TaxP>  , 


The  equations  formed  will  be  of  the  following  fora: 


Y  =  Bq  +  D1*V  +  B2«W  +  ByTexp  +  E  (11) 

or 

Log(Y)  =  BQ  +  B1 »Log(V )  +  B2.Log(W)  +  B3.Log(Texp)  +  E(12) 
The  use  of  variables  W  and  W  produce  nearly  iden- 

CX  O  ^ 

tical  results  in  all  of  the  regressions  where  they  are  used. 
For  simplicity  sake,  W  &  will  be  omitted  from  further  consi¬ 
deration.  W  produced  consistently  slightly  better  results 

D 

than  W  so  where  there  ere  a  considerable  number  of 
a 
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combinations  to  be  checked,  only  a  few  using  W  will  be 

a 

checked  and  reported. 

Point  g  in  Pig.  8,  Y  vs  Weight,  and  Point  e  in  Pig.  9, 
Y  vs  Speed,  are  the  most  discontinuous  and  therefore  the  most 
difficult  to  estimate.  The  initial  regressions  were  per¬ 
formed  using  information  on  only  six  of  the  seven  aircraft 
for  which  information  is  available.  That  is,  these  two  cri¬ 
tical  points  were  excluded  when  either  weight  or  speed  was 
used  as  an  independent  variable.  The  regressions  were  then 
repeated  with  these  critical  points  included.  The  results  of 
these  two  series  were  then  compared  for  their  ability  to  ac¬ 
curately  predict  the  critical  points.  Additional  comparisons 
can  be  made  from  the  coefficients  of  determination. 

The  results  of  these  regressions  are  summarized  in 
Table  III.  The  table  shows  only  a  selected  few  of  the  vari¬ 
able  combinations  tested.  Those  selected  were  chosen  to  pre¬ 
sent  a  balanced  view  of  the  results  which,  as  can  be  seen, 

p 

were  quite  poor.  The  R  are  quite  low  and  the  partial  coef- 

2 

ficients  of  determination  (R^  )  are  low  and  unbalanced.  All 
of  the  regressions  using  six  observations  produce  large  devi¬ 
ations  in  the  resulting  estimates  of  the  critical  points  (de¬ 
noted  in  the  table  by  Y*-Y).  The  inclusion  of  the  seventh 
observation,  as  expected,  produced  better  estimates  of  the 
critical  points  but  these  better  estimates  were  still  quite 
poor.  If  normality  in  the  error  terms  (E)  of  the  regression 
equations  is  assumed  and  the  relation  between  the  variables 
tested  with  the  P  statistic  (Ref  24:123),  a  significant  rela- 
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TABLE  III 

SUMMARY  0?  REGRESSION  ANALYSIS  OP  Y  ON  V,  Wf  TeXJ) 


Dependent 

Y 

Y 

Y 

Y 

Y 

Y 

Variable 

1 1 

xa 

1 

1 1 

*a 

Independent 

V 

V 

V 

V 

V 

V 

Variables 

Wg 

wg 

% 

Wg 

V/g 

w« 

Texp 

Tsxp 

Texp 

Tsxp 

Equation 

Form 

Log 

Log 

Log 

Log 

Linear 

Linear 

Sample  Size 

7 

7 

6 

6 

6 

6 

R2 

.023 

00 

<•'0 

o 

• 

.669 

,891 

.703 

.831 

R12 

.00 

.04 

.44 

.49 

.59 

.63 

r22 

.02 

.00 

.11 

.54 

.03 

.00 

r32 

- 

- 

.55 

,75 

.65 

.73 

Y*-Y 

-4.47 

-4.0 

-9.66 

-9.52 

-9.11 

-11.40 

F  Ratio 

.05 

.08 

1.5 

5.4 

1.6 

3.3 

F  Statistic 

4.32 

4.32 

9.16 

9.16 

9.16 

9.16 

Note:  Symbols  defined  on  pages  ix,  x,  and  xi. 
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ticn  between -the  variables  is  rejected  in  every  case  with  a 
critical  region  of  size  .10. 

The  above  regressions  used  Tex^  in  two  forms.  The 
first  was  calculated  by  selecting  a3  a  base  year  one  year 
prior  to  the  start  of  production  for  the  oldest  aircraft  in 
the  sample.  This  base  year  (and  month)  was  then  subtracted 
from  the  start  production  dates  for  the  remaining  aircraft  to 
produce  a  time  scale.  The  second  form  of  Tex^  used  a  base 
year  five  years  prior  to  the  first  and  was  calculated  in  a 
similar  manner.  Both  of  these  variables  produce  essentially 
the  same  results  and  the  better  of  the  two  is  shown  in  Table 


III. 

The  next  series  of  regressions  was  designed  to  check 
if  time  used  in  another  form  produced  better  results.  The 
concept  of  straight  line  time  was  substituted  for  exponential 
time  and  the  series  repeated.  Straight  line  time  is  calcu¬ 
lated  and  used  in  the  following  way. 

A  suitable  learning  fraction  (r)  must  first  be  selected 
to  represent  the  amount  of  reduction  in  labor  requirements 
that  might  occur  if  a  task  were  to  be  performed  at  one  point 
in  time  rather  than  ten  years  earlier.  Then  the  time  adjust¬ 
ment  factor  for  the  start  production  date  of  aircraft  i  can 
be  found  by  the  equation: 


T 


rvi 


=  r 


Vdb 

TO 


(13) 


i  *  1 


-7 


where 


r  is  the  selected  learning  fraction  between 
0  and  1 . 0 
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d^  is  the  start  production  date  in  years  for 
the  ith  aircraft  (to  the  nearest  twelfth) 
d^  is  the  selected  base  year  (to  the  nearest 
twelfth)  and  remains  the  same  for  all 
seven  calculations 


Equation  (13)  raises  the  learning  fraction  to  a  power  equal 
to  one  tenth  the  difference  between  two  dates.  Tables  IV  and 
V  show  the  resulting  time  adjustment  factors  for  a  learning 
fraction  of  0,8  and  base  years  of  1945  and  1970  respectively. 


TABLE  IV 

TIME  ADJUSTMENT  FACTOBS 
r  =  .8  db  =  1945 


Start 

Date 

Production 

1945 

1950 

1955 

I960 

1965 

1970 

Time 

Factor 

1.00 

.895 

.800 

.715 

.640 

.572 

TABLE  V 

TIME  ADJUSTMENT 

FACTORS 

r 

=  .8 

db  =  1970 

Start 

Date 

Production 

1945 

1950 

1955 

I960 

1965 

1970 

Time 

Adjustment 

1.75 

1.57 

1.40 

1.25 

1.12 

1.00 

Factor 


Table  IV  can  be  interpreted  to  show  xhe  labor  require¬ 
ments  if  the  performance  of  a  1945  task  requiilng  one  hour  was 
delayed  to  the  later  years  shown.  Table  V  on  the  other  hand 
shows  the  labor  requirements  if  a  task  requiring  one  hour  in 
1970  was  performed  at  an  earlier  date. 


% 
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Both  of  these  tables  assume  that  there  is  a  twenty 
percent  reduction  in  labor  requirements  each  ten  yeais.  From 
Table  V  it  is  possible  to  make  estimates  of  present  labor  re¬ 
quirements  if  tasks  performed  in  earlier  years  were  actually 
performed  for  the  first  time  in  1970  and  2C$  saving  in  labor 
is  an  appropriate  figure.  By  dividing  the  actual  historical 
labor  expenditures  incurred  in  1945  by  1.75  the  estimated  1970 
labor  requirements  for  a  1945  task  can  be  found. 

Using  equation  (13)  and  dg  equal  to  the  start  produc¬ 
tion  date  for  the  last  aircraft  produced  in  the  study  sample, 
straight  line  time  adjustment  factors  were  calculated  for  all 
ai:  craft  in  the  sample  for  each  .05  increment  between  .65  and 
.90.  Y1  and  Yq  were  divided  by  these  time  adjustment  factors 
and  the  resulting  Y/Tr’s  were  regressed  on  the  same  weight 
and  speed  variables  used  above.  The  results  were  very  simi¬ 
lar  to  the  first  series  of  regressions.  Table  VI  shows  the 
results  that  were  obtained  for  some  variable  combinations  not 
shown  in  Table  III.  The  reason  for  showing  only  T  g  is  not 
because  T  g  produced  any  better  or  worse  results  than  other 
straight  line  time  fractions,  but  that  T  g  will  play  an  impor¬ 
tant  role  in  later  analysis  and  this  information  will  be 
available  for  comparison. 

The  graph  of  total  labor  hours  vs  weight  shown  in  Fig. 
11  provides  the  motivation  for  the  next  series  of  regressions. 
The  curve  of  H  vs  Weight  is  so  much  more  continuous  in  appear¬ 
ance  that  it  suggests  H  might  be  easier  to  estimate  than  Y. 

To  determine  if  this  is  so,  linear  and  log  linear  regressions 
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TABLE  VI 

SUMMARY  OF  REGRESSION  OF  Y/T  q  ON  V,  V 


Dependent 

Variable 

Vt.8 

VT.8 

Vt.8 

Vt.8 

Vt,8 

Vt.8 

Independent- 

Variable 

Aa 

*a 

'*6 

V 

V 

Equation 

Form 

Log 

Log 

Log 

Log 

Log 

Log 

Sample  Size 

7 

7 

7 

7 

7 

7 

a2 

.381 

.348 

.338 

.307 

.494 

.445 

v#_y 

-4.77 

-5.25 

-4.63 

-5.03 

-7.44 

-6.75 

F  Ratio 

3.07 

2.67 

2.56 

2.21 

4.89 

4.01 

F  Statistic 

4.06 

4.06 

4.-06 

4.06 

4.06 

4.06 

TABLE  VI 

Continued 

Dependent 

Variable 

Vt.8 

Vt.8 

VT.8 

VT.8 

VT.8 

V.8 

Independent 

Variable 

Wa 

*« 

Wa 

V 

V 

Equation 

Form 

Linear 

Linear 

Linear 

Linear 

Linear 

Linear 

Sample  Size 

6 

6 

6 

6 

7 

7 

H2 

.040 

.047 

.000 

.000 

.472 

.427 

Y*-Y 

-7.76 

-7.99  - 

•10.13 

-6.75 

-6.90 

-6.32 

F  Ratio 

.16 

.19 

,00 

• 

o 

o 

4.47 

3.73 

F  Statistic 

4.54 

4.54 

4.54 

4.54 

4.06 

4.06 

* 
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were  performed  using  the  following  variables.  The  dependent 
variables  were  chosen  from: 


H 


1» 


The  independent  variables  were  chosen  from: 

1.  V 


3.  Texp 


V.'here  H/T  was  chosen  as  a  dependent  variable.,  only  one  or  two 
of  the  variables  from  categories  1  and/or  2  rere  selected. 
Yrtiere  H,  or  H,  was  selected  aa  the  dependent  variable  as 

CL 

many  as  three  were  selected  with  no  more  than  one  Y»  variable 
in  any  one  equation.  The  same  two  time  scales  for  Texp  that 
were  used  \n  the  first  series  of  regressions  were  used  in 
this  series.  Each  set  was  regressed  both  with  and  without 
che  critical  points. 

The  results  of  this  series  of  regression  is  shown  in 
Te;ble  VII.  From  a  statistical  point  of  view  the  results  are 
very  much  better  than  the  previous  series.  However,  the 
ability  of  these  equations  to  estimate  the  critical  points  is 
again  very  poor.  As  in  the  previous  tables,  Table  VII  gives 
a  cross  section  of  the  results  and  all  of  the  combinations 
shown  in  the  table  were  regressed  without  the  critical  points 
included.  The  results  with  the  critical  points  included  dif¬ 
fered  very  little  from  the  results  shown  in  Table  VII, 
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Equations  derived  with  the  critical  included  were,  a3  expected, 
better  able  to  estimate  the  critical  points  but  the  improvement 
was  at  the  expense  of  poorer  estimates  of  other  sample  points. 
Overall,  as  estimating  equations,  they  have  high  variance  in 
their  estimates  of  the  sample  points  (i3.0  being  typical). 

One  convention  is  adopted  at  this  point.  When  equa¬ 
tions  are  developed  that  estimate  H  rather  than  Y,  the  point 
being  estimated  will  be  divided  by  its  AKPR  weight  and  devia¬ 
tions  expressed  in  terms  of  Lours  per  AKPR  pound.  In  this 
way,  all  deviations  can  be  kept  consistent  and  comparable. 

The  reason  for  the  poor  performance  of  these  estima¬ 
ting  equations  can  be  seen  by  examining  the  partial  coeffi¬ 
cients  of  determination.  The  partial  coefficients  for  weight 
variable  is  greater  than  .90  in  every  case  and  the  partials 
for  the  other  variables  are  low  and  erratic.  These  equations 
are  essentially  functions  of  weight  alona  with  just  minor  in¬ 
fluences  by  time  and  speed.  A  more  balanced  relation  is 
needed  for  a  good  estimating  relation. 

Before  continuing  the  search  for  a  balanced  relation, 
it  might  he  well  to  point  out  some  tentative  conclusions 
based  on  the  first  three  series  of  regressions.  Linear  and 
log  linear  combinations  of  weight,  speed,  and  time  apparently 
cannot  provide  low  variance  estimates  for  Y  and  H  by  them- 
selves.  Either  Y  and  H  are  not  estimable,  or  other  variables 
must  be  added  to  these  to  explain  Y  and  H,  or  an  altogether 
different  set  of  variables  must  be  used  if  linear  or  log 
linear  relations  are  to  be  found.  The  addition  of  more 
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variables  to  the  estimating  equations  is  not  desirable.  Add¬ 
ing  r.ore  variables  cannot  make  an  equation  produce  poorer  es¬ 
timates.  In  fact,  they  can  only  stay  as  good  or  improve.  By 
adding  enough  arbitrarily  selected  variables,  an  equation  can 
be  made  to  produce  near  perfect  estimates  where,  in  fact,  the 
relations  used  may  be  nonsense.  With  the  sample  size  used  in 
this  study,  statistical  inference  on  more  than  two  variables 
would  be  extremely  weak  (Ref  26:61).  Since  other  variables 
are  not  available,  the  search  for  estimating  relations  must  be 
ended  or  turned  to  non-linear  forms.  It  is  toward  non-linear 
forms  that  this  study  now  turns. 

Regression  of  Non-Linear  Variables.  Fig.  11  again 
provides  the  clues  for  the  analysis  that  follows.  In  examin¬ 
ing  the  individual  aircraft  in  Fig.  11  (First  Unit  Labor 
Costs  in  Hours  Vs  Maximum  Gross  Weight)  for  likely  reasons 
why  their  total  hours  deviated  from  some  imaginary  underlying 
curve,  it  became  apparent  that  the  deviations  were  related 
very  strongly  to  time.  Rough  calculations  showed  that  the 
older  the  start  production  date,  the  further  the  point  was 
from  an  imaginary  smooth  curve.  To  see  how  strong  the  influ¬ 
ence  of  time  was,  each  H  was  divided  by  T  and  plotted  on 
another  graph.  It  was  apparent  that  T  ^  was  too  large  an 
adjustment  and  T  g  was  selected  and  plotted.  The  results  of 
this  adjustment  are  shown  in  Fig.  13 >  H&  and  Ha/T.g  vs  AMPR 
weight.  Compared  to  previous  curves  K&/T  g  vs  AMFR  weight  is 
quite  smooth  and  continuous.  It  strongly  resembles  the  first 
quadrant  graph  of  a  parabola  with  its  axis  parallel  to  the 
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Fig,  13.  First  Unit  Labor  Costs 
(in  hours)  Adjusted  for  Time  Vs  AMPR  Weight 
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vertical  axis.  The  graphs  of  H./T  g  plotted  against  AMPR 
weight  and  both  H.j/T  g  and  Ka/T  g  plotted  against  gross 
weight  are  very  similar.  The  general  shape  of  these  curves 
is  retained  on  rectangular  coordinate  graph  paper.  The  pic¬ 
tures  presented  by  these  graphs  suggest  that  a  quadratic 
equation  in  H/T  and  W  and  a  quadratic  equation  in  the  logs  of 
these  valuables  might  produce  very  good  estimating  equations. 
These  equations  would  be  of  the  form: 

K/T^g  =  Bq  +  B^W  +  B2.(W)2  +  E  (14a) 

and 

Log(H/T.g)  =  Bq  +  B1 °Log(W )  +  B2*  Log(W)  2  +  E  ( 14b ) 

Combinations  of  H. ,  K  ,  W  ,  W  were  examined.  Before  showing 

1  a*  g’  a 

the  results,  it  might  be  well  to  perform  the  same  type  of 
time  adjustment  on  Y  vs  Weight  and  examine  the  results  for 
possible  relations. 

Fig.  14  shov/s  the  graph  of  Y/T  g  vs  AKPR  Weight.  The 
graph  of  Y  vs  Gross  Weight  is  omitted  for  simplicity  sake. 
This  graph  also  suggests  that  a  quadratic  equation  of  the 
above  form  with  H  replaced  by  Y  might  give  good  estimating 
equations . 

The  variable  combinations  just  mentioned  give  16  pos¬ 
sible  quadratic  estimating  equations.  Since  the  square  of 
v/eight  and  the  square  of  the  log  of  weight  can  be  calculated 
and  used  as  variables,  equations  (14a)  and  (14b)  are  still 
suitable  for  linear  regression,  it  is  possible  then  to  use  a 
least  squares  linear  regression  program  to  fit  the  quadratic 
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Pig.  14.  First  Unit  Labor  Costs 
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Vs  AMPR  Weight 
59 


GSA/SM/70-17 


curves  and  have,  at  the  same  time,  all  the  statistical  tools 
that  v/ere  available  for  multiple  linear  regression  (Ref  19s 
40,  107-123). 

The  regressions  of  these  quadratic  equations  were 
performed  in  the  same  manner  as  the  earlier  series  of  regres¬ 
sions,  that  is,  omitting  the  critical  points  and  using  a 
sample  size  of  six  and  then  repeating  the  regressions  with 
the  critical  points  included. 

The  results  of  these  regressions  are  very  good  from  a 
statistical  point  of  view  for  both  the  sample  size  of  six  and 
seven.  All  of  the  equations  derived  from  a  sample  size  of 
seven  produced  low  variance  estimates  of  the  critical  points. 
However,  that  was  not  the  case  with  the  sample  size  of  six. 

In  fact,  some  of  the  estimates  were  the  poorest  of  all  the 
equations  examined  so  far.  In  order  to  show  the  dangers  of 
indiscriminately  applying  equations  of  this  form,  the  results 
of  all  the  quadratic  regressions  with  the  critical  points  ex¬ 
cluded  will  be  shown  in  the  following  tables.  ?rom  the  best 
of  these,  equations  will  be  selected  for  statistical  analysis. 

The  equations  in  Table  VIII  produced  the  poorest  sta¬ 
tistical  results  and  had  the  greatest  deviation  in  the  esti¬ 
mates  for  the  critical  point3.  The  critical  point  estimates 
of  these  equations  v/ere  in  error  by  as  much  as  a  factor  of  5. 
In  Tables  IX,  X,  and  XI  the  variable  combination  H^/T  g,  or 
Yr/T  g  with  W&  produced  the  poorest  results.  The  remaining 
equations,  with  one  exception,  were  all  within  30$  of  the  ac¬ 
tual  value  of  the  critical  point.  These  eight  equations, 
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TABLE  VIII 

SUMMARY  OP  QUADRATIC  REGRESSION  Y/T^g  ON  W 
(In  Form  of  Equation  (14a)  Critical  Points  Excluded) 


Equation  Number 

15 

16 

17 

18 

Independent 

Variable 

V*.8 

V*.8 

V1. 8 

VT.8 

Dependent 

Variable 

vj 

a 

w 

g 

w 

a 

\ 

R2 

.878 

.891 

.885 

.879 

R12 

.86 

.87 

.88 

.87 

r22 

.87 

.88 

.88 

.88 

Y*-Y 

4121.5 

491.3 

498.9 

473.2 

P  Ratio 

10.8 

12.3 

11.6 

10.9 

P  Statistic 

5.46 

5.46 

5.46 

5.46 

TABLE 

IX 

SUMMARY  OP  QUADRATIC  REGRESSION  OP  K/T.g  ON  W 
(In  Pom  of  Equation  (14a)  Critical  Points  Excluded) 

Equc.tion  Number 

19 

20 

21 

22 

Independent 

Variable 

VT.8 

VT.8 

Vt.8 

V*.8 

Dependent 

Variable 

•  Wa 

"a 

R2 

.999 

.998 

.999 

.999 

R12 

.76 

.59 

.91 

.89 

R22 

.98 

.87 

.95 

.87 

Y*-Y 

+9.03 

41.00 

44.20 

41.28 

P  Ratio 

3788. 

717. 

3162. 

1493. 

P  Statistic 

5.46 

5.46 

5.46 

5.46 

61 


GSA/SM/70-17 


TA3LE  X 

SUMMARY  OF  LOG  QUADRATIC  REGRESSIO'  'T.8  ON  W 
(In  Form  jf  Equation  (14b)  Critical  Poi.  s  Excluded) 


Equation  Number 

23 

24 

25 

26 

Independent 

Variable 

VT.8 

V*.8 

Vt.8 

VT.8 

Dependent 

Variable 

w 

a 

Wg 

"a 

wg 

R2 

.975 

.980 

.961 

.963 

R12 

.97 

.98 

.96 

.98 

r22 

.97 

.98 

.96 

.98 

Y*~Y 

17.28 

8.98 

6.66 

2.33 

F  Ratio 

58.0 

73.4 

36.7 

39.3 

F  Statistic 

5.46 

5.46 

5.46 

5.46 

TABLE  XI 

SUMMARY  OF  LOG  QUADRATIC  REGRESSION  H/T.g  ON  W 
(In  Form  of  Equation  (14b)  Critical  Points  Excluded) 

Equation  Number 

27 

28 

29 

30 

Independent 

Variable 

H/T.8 

H/T.8 

Ha/T.8 

Ha/T.8 

Dependent 

Variable 

"a 

wg 

\ 

w 

g 

R2 

.999 

.993 

.999 

.997 

R12 

.94 

.80 

.84 

.78 

r22 

.97 

.90 

.96 

.92 

Y*-Y 

+17.27 

+3.89 

6.65 

-3.53 

F  Ratio 

1058. 

228. 

1071. 

480. 

F  Statistic 

5.46 

5.46 

5.46 

5.46 
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numbers  20,  21,  22,  25,  26,  28,  29,  and  30  are  selected  for 
further  analysis. 

Before  turning  to  the  statistical  tests,  it  is  worth 

pointing  out  some  relations  between  the  variables  that  can  be 

seen  in  the  four  tables.  Generally,  Ya/T  q  produces  better 

estimates  of  the  critical  point  than  does  Y^/T  g.  The  same 

is  true  of  H&/T  g  over  H^T  g.  H  in  general  is  better  than  Y 

and  W  is  better  than  W  .  These  trends  were  observed  earlier 
g  a 

in  the  linear  regressions,  but  the  linear  equations  were  so 
poor  that  not  much  attention  was  paid  to  them.  If  these  trends 
continues  with  tne  addition  of  ths  critical  points  to  the 
sample  size,  then  we  should  expect  that  equations  22  and  30 
would  produce  the  best  results. 

Statistical  Analysis 

The  statistical  analysis  will  be  concerned  with  two 
areas.  The  first  is  the  nature  of  the  distribution  of  error 
terms,  or  residuals  as  they  are  often  called,  of  the  equa¬ 
tions  that  are  selected  as  good  estimating  equations.  After 
determining  an  appropriate  model  for  the  error  terms,  the  se¬ 
lected  distribution  will  be  used  to  provide  a  80%  confidence 
interval  for  expected  value  of  the  labor  costs  of  a  hypothe¬ 
tical  500,000  pound  maximum  gross  weight  aircraft  produced  in 
mid  1970. 

The  criteria  used  to  select  the  good  estimating  equa¬ 
tions  must  insure  that  they  are  statistically  sound  and  capable 
of  providing  realistic  estimates.  To  insure  this,  the  equa¬ 
tion  regressed  without  critical  points  must  possess  a 
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coefficient  of  determination  greater  than  .80  and  be  able  to 
estimate  the  critical  points  within  plus  or  minus  30#  before 
it  will  be  considered  for  the  statistical  analysis.  All 
equations  regressed  were  examined  using  this  criteria  and 
only  the  eight  previously  mentionea  ,jadratic  equations 
qualified. 

Distribution  of  the  Error  Term.  Tab] e  XII  shows  the 
deviations  between  the  equation  estimate  for  the  hours  per 
AMPR  pound  for  each  of  the  seven  aircraft  and  the  actual 
value  of  Y1  or  Yfl  that  the  equation  is  based  on.  Where  the 
equation  is  in  log  form  or  the  dependent  variable  is  in  hours 
rather  than  hours  per  AMPR  pound,  these  values  were  converted 
to  hours  per  AMPR  pound  for  calculating  the  deviations. 

These  values  are  shown  to  provide  some  feel  for  the  accuracy 
of  the  selected  equations  and  are  not  the  deviations  to  be 
analyzed  for  the  distribution  of  the  error  terms.  Notice 
that  equations  20-22  are  rather  poor  in  their  ability  to  es¬ 
timate  the  first  five  aircraft  when  compared  to  the  remaining 
equations,  but  they  are  very  good  on  aircraft  6  and  7.  The 
log  quadratic  equations  on  the  other  hand  seem  to  be  equally 
good  throughout  all  aircraft. 

Returning  to  the  question  of  the  distribution  of  the 
error  terms,  regression  theory  provides  many  ways  to  perform 
calculations  of  the  range  of  possible  errors  of  a  prediction 
provided  the  assumption  can  be  made  that  the  error  terms  are 
distributed  normally.  To  make  use  of  these  methods,  it  is 
necessary  first  to  insure  that  the  normal  distribution  is  a 
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reasonable  model  for  the  distribution  of  the  error-  terms  in 
the  selected  equations. 

To  do  this,  the  Kolmogorov-Smirnov  Test  for  goodness 
of  fit  is  used.  This  test  examines  the  deviations  between 
actual  and  estimated  values  in  the  domain  and  the  units  that 
the  equation  is  written.  For  example,  equation  (30)  is  a 
quadratic  in  Log(Ha/T  g)  and  Log(Wg).  The  deviation  of  in¬ 
terest  in  this  relation  is  the  deviation  between  the  esti¬ 
mated  value  for  Log(H&/T  g)  and  the  actual  observed  value  for 
Log(Ha/T  g).  This  deviation  for  all  seven  aircraft  must  be 
calculated  and  compared  to  the  Kolmogorov-Smirnov  Statistic 
and  some  conclusion  is  made  concerning  the  reasonableness  of 
a  norrral  assumption.  The  actual  calculations  for  this  test 
are  developed  in  Appendix  A.  It  is  sufficient  here  to  say 
that  it  is  impossible  to  reject  the  normal  assumption  using 
the  Kolmogorov-Smirnov  Test  even  with  the  largest  critical 
region  available.  The  conclusion  is  that  the  error  term 
might  not  be  distributed  normally,  but  the  normal  distribu¬ 
tion  is  a  very  good  approximation  of  the  distribution  of  the 
error  terms  for  a31  eight  equations. 

Prediction.  If  the  error  terms  are  distributed  nor¬ 
mally,  then  the  t-statistic  can  be  used  along  with  the  dis¬ 
tribution  of  the  estimate  to  provide  a  confidence  interval 
for  the  prediction.  Generalizing  equations  (14a)  and  (14b) 
gives  an  equation  of  the  form: 

Z  =  BQ  +  B.X1  +  B2X2  +  E  (31) 


♦ 
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where  Z  is  the  Y  or  H  dependent  variable  or  its  log,  and  X1 
is  the  independent  ff  variable  or  its  log,  and  X2  is  the 
8 qua re  of  X^. 

Knowing  the  weight  of  a  proposed  aircraft  (say  X*)  and 
when  it  will  be  built,  and  assuming  that  S  is  distributed 
normally  we  can  say  that  the  estimated  labor  costs  for  this 
aircraft  (Z*)  is  also  distributed  normally  with: 

Expected  Value  of  Z*  =  E(Z*)  =  BQ  +  B^*  +  B^*  (32) 

Variance  of  Z*  =  S2  *=  x*»V«x  (33) 

Equation  (33)  is  expressed  in  matrix  notation  and  x  and  V  are 

defined  as  follows:  x  is  a  column  vector  (in  this  case  2x1) 

of  the  deviations  of  X.*  from  the  mean  value  (?)  of  the  X-  of 

• 

the  original  sample  (size  j)  that  was  used  in  forming  the  re¬ 
gression  equation;  i.e. 

i 

?.  =  4- 

xi  =  Xi*  ’  X±  (35) 

V  in  equation  (33)  is  the  variance-covariance  matrix  for  the 
regression  coefficients  (in  this  case,  and  B2)  for  the 
equation  that  is  providing  the  estimate  (Ref  24:132-134). 

Knowing  the  distribution  of  Z*  allows  us  to  calculate 
a  confidence  interval  of  size  100(1-2a)  i*  with  the  use  of  the 
t~statistic.  The  equation  for  this  calculation  is: 

E(Z*)  ~  t.  •  S  (36) 

a,  j 
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Knowing  the  size  of  the  interval  desired  allows  a  to  he  de¬ 
termined.  The  second  parameter  of  the  t-etatistic  is  the 
sample  size  from  which  the  regression  equation  was  derived. 
For  the  equations  in  this  study,  and  a  confidence  interval  of 
8O7*,  a  =  .10  and  j  =  7.  The  appropriate  t-statistic  is 

*a,J  =  1’415  t37) 

Assuming  that  a  confidence  interval  is  desired  for  a 
proposed  aircraft  that  weighs  500,000  pounds  maximum  gross 
weight  for  take-off,  or  188,350  pounds  AMPR,  and  that  the 
aircraft  will  he  huilt  in  the  middle  of  1S70,  all  the  neces¬ 
sary  information  is  available  to  construct  the  confidence  in¬ 
tervals  for  the  expected  labor  costs  in  hours  per  AMPR  pound. 

p 

Solving  for  the  values  of  E(Z*)  and  S  by  equations  (32)  and 
(33)  for  each  estimating  equation  and  using  the  relation  (36) 
gives  the  confidence  intervals  shown  in  Table  XIII  and  Fig. 
15. 

It  should  be  noted  that  there  is  no  method  of  calcu¬ 
lating  confidence  intervals  for  the  lognormal  distribution; 
i.e.,  the  inverse  logarithmic  transformation  of  a  normal  dis¬ 
tribution  (Ref  1:50,  85).  This  requires  that  the  confidence 
intervals  for  the  log  quadratic  equations  be  performed  in  the 
log  domain.  Since  the  inverse  logarithmic  transformation  is 
a  monotonic  function,  the  inverse  transformation  of  a  valid 
probability  statement  in  the  log  domain  holds  with  the  same 
probability;  i.e.,  if: 

a  =  P(b<Z<c)  b,Z,e>0  (38) 


♦ 
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TABLE  XIII 

80 $  CONFIDENCE  INTERVAL  FOR  E(2»)  IN  HOURS/AMER  POUND 


Equation 

Number 

Lower 

Limit 

E(Z*) 

Upper 

Limit 

Width 

20 

19.10 

19.31 

19.51 

.41 

21 

20.34 

20.52 

20.70 

.36 

22 

19.17 

19.35 

19.53 

.36 

25 

19.56 

20.22 

20.89 

1.33 

26 

19.64 

20.09 

20.54 

.90 

28 

17.96 

18.89 

19.87 

1.91 

29 

19.56 

20.21 

20.89 

1.33 

30 

18.61 

19.23 

19.87 

1.26 
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Hours  per  AMPR  Pound 


Pig.  15. 

80$  Confidence  Intervals  for  E(Z$ 
(Rectangular  Coordinates) 
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is  true,  then 

a  *  P(eb<eZ<ec)  (39) 


is  also  true.  Showing  that  the  error  terms  distribution  can 
be  assumed  normal  in  the  log  domain  allows  confidence  inter¬ 
vals  to  be  calculated  in  the  log  domain  and  then  transformed 
to  the  corresponding  values  of  H  and  Y. 

It  should  also  be  noted  that  the  width  of  the  confi¬ 
dence  interval  for  each  equation  is  not  a  constant.  The  in- 


varoa  transformations  of  the  log  confidence  intervals  ere  not 
symmetric  around  E(Z*).  The  width  of  all  the  confidence  in¬ 
tervals  will  vary  with  changes  in  the  weight  of  t^e  aircraft 
being  estimated.  An  example  might  illustrate.  If  a  100,000 
pound  gross  weight  aircraft  with  an  AMPR  weight  of  37,670 
poruids  is  to  be  estimated  for  production  in  mid  1970,  equa¬ 
tions  (22)  and  (29)  give  the  following  results: 


TABLE  XIV 


80$  CONFIDENCE  INTERVAL  FOR  A  100,000  POUND  AIRCRAFT 


Lower  Limit 

E(Z*) 

Upper  Limit 

Width 

Equation  (22) 

13.61 

13.80 

13.98 

.37 

Equation  (29) 

14.16 

14.41 

14.66 

.50 

In  this  example,  the  width  of  the  interval  for  equations  (22) 
increased  slightly  over  width  in  the  previous  example.  The 
width  of  the  interval  for  equation  (29),  however,  is  only 
37.6$  of  its  previous  value. 
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Summary  of  the  Results 

The  variables  of  the  eight  estimating  equations,  the 
equation  form  and  the  b  coefficients  are  shown  in  Table  XV. 

The  first  three;  i.e.,  equations  (20)  through  (22)  are  of  Q 
form,  that  is 

ip  =  Bq  +  +  B2(W)2  (40) 

.8 

The  weight  variable  is  identified  as  gross  weight  or  AMPR 
weight  in  the  table.  H  now  loses  its  identifying  subscript 
since  it  is  the  estimate  of  the  hours  required  to  produce  an 
aircraft  that  is  not  to  be  preceded  by  prototype  production. 
Estimating  the  hours  required  if  a  prototype  is  produced  is 
not  considered. 

The  remaining  five  equations  are  of  L  form.  That  is 

l»og(Z/T  g)  =  Bq  +  B^Log(W)  +  B2(Log  (W))2  (41 ) 

Whether  the  dummy  variable  Z  is  Y  or  H  is  identified  in  the 
table  as  is  the  weight  variable. 

With  these  equations,  it  should  be  possible  to  esti¬ 
mate  labor  costs  early  in  the  development  phase  of  an  air¬ 
craft.  Knowing  maximum  gross  weight  alone  will  provide  four 
estimates  of  the  total  labor  costs  in  hours  and  one  estimate 
of  the  labor  costs  in  hours  per  AMPR  pound.  Further  down  the 
program  life,  when  engineering  details  are  more  clear  and 
AMPR  weight  is  known,  earlier  estimates  can  be  checked  with 
the  three  remaining  equations. 

The  major  unanswered  question  concerning  the  relations 
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derived  in  this  analysis  is  do  they  really  exist  in  the  forms 
shown,  or  has  a  series  of  adjustments  transformed  a  set  of  un¬ 
related  variables  into  an  apparently  related  set  that  has  no 
real  meaning?  The  next  chapter  looks  at  this  question.  Un¬ 
fortunately,  it  is  neither  possible  to  prove  or  contradict 
the  relations  presented  here  within  the  time  and  resources 
constraints  that  must  be  honored. 
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VI .  Remaining  Questions 

Two  ma;jor  questions  need  answering  before  the  rela¬ 
tions  presented  in  this  study  can  be  applied  generally.  The 
first  concerns  aircraft  designed  to  perform  different  mis¬ 
sions  or  aircraft  built  by  different  manufacturers.  Do  the 
first  unit  labor  costs  for  other  aircraft  behave  in  a  similar 
or  dissimilar  manner  when  corpared  to  the  first  unit  costs 
studied  here?  The  second  major  question  concerns  the  time 
variable  T  g  and  the  important  role  it  plays.  The  selection 
of  T  g  was  arbitrary.  The  choice  of  T  g  was  guided  by  its 
ability  to  fit  the  data  better  than  any  other  time  variable 
and  give  the  smoothest,  most  continuous  relations.  Is  T  g  a 
reasonable  choice?  Pull  answers  to  these  questions  will  not 
be  attempted.  Instead,  a  brief  inquiry  will  be  made  to  see 
if  the  results  of  this  study  are  reasonable  and  consistent 
with  some  readily  available  data. 

Time 

Since  the  tightness  ol  the  fit  between  the  estimating 
equations  and  the  data  was  made  possible  by  the  use  of  T  g, 
it  will  be  considered  first.  Denison  (Ref  14:158-160)  ana¬ 
lyzes  in  considerable  detail  the  changes  that  have  occurred 
in  productivity  of  the  national  labor  force  during  this  cen¬ 
tury.  The  data  he  presents  is  based  on  Commerce  Department 
Data  concerning  the  GNP.  The  commerce  department  data  is  con¬ 
verted  into  an  index  of  productivity  and  displayed  graphi¬ 
cally.  This  index  allows  the  compariscn  of  output  per 
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man-hour  in  different  time  periods.  His  figures  extended  to 
1958  which  means  that  they  cover  a  good  deal  of  the  period 
studied  in  this  analysis.  The  index  of  productivity  in  1958- 
was  190.  The  index  of  productivity  in  1945  was  either  137, 
using  the  actual  curve,  or  145  using  the  regression  best  fit 
curve.  The  interpretation  of  this  productivity  index  is  es¬ 
sentially  the  same  time  factor  discussed  earlier.  It  is  pos¬ 
sible  to  build  190  of  something  in  1958  for  the  same  amount 
of  labor  required  to  build  between  137  and  145  of  the  same 
thing  in  1945. 

Normalizing  these  figures  with  respect  to  1958  gives 
1.0  for  1958  and  between  .723  and  .763  for  1945.  Over  this 
13  year  period,  when  this  is  converted  to  a  straight  line 
time  learning  fraction,  we  find  that  the  nation's  work  force 
followed  a  straight  line  time  fraction  of  between  .777  and 
.811.  In  light  of  this,  the  selection  of  T  g  seem  very  good. 
However,  because  the  nation's  work  force  can  be  approximated 
by  T  g  there  is  no  guarantee  that  the  airframe  manufacturing 
industry  progressed  at  the  same  rate.  It  is  also  plausible 
that  the  labor  savings  that  might  have  been  realized  in  the 
airframe  industry  could  have  been  absorbed  by  the  increasing 
technological  complexity  of  their  product. 

It  also  seems  possible  that  learning  fractions  could 
vary  from  one  type  of  aircraft  to  another  depending  on  the 
degree  of  complexity  of  the  aircraft  concerned.  This  same 
variance  might  also  occur  between  different  manufacturers  de¬ 
pending  upon  how  resource  ussge  is  accounted  for,  management 
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competence,  and  production  techniques  unique  to  each  manufac¬ 
turer.  Determining  an  appropriate  learning  fraction  for  a 
particular  set  of  aircraft  could  be  a  major  undertaking.  Un¬ 
til  it  is  done,  it  seems  reasonable  to  use  I  g  or  any  other 
reasonably  close  fraction  in  estimating  equations. 

Other  Airframes  and  Manufacturers 

Since  detailed  information  was  not  available  from 
another  manufacturer,  it  is  impossible  to  perform  the  same 
type  of  analysis  on  another  data  set  to  confirm  or  contradict 
the  relationships  reported.  As  a  substitute,  some  data  was 
gathered  from  a  contract  proposal  on  file  in  the  Aeronautical 
Systems  Division  Cost  Data  Library,  Wright-Patterson  Air  Force 
Base,  Ohio.  Again  the  problem  of  privileged  information  was 
encountered.  The  actual  data,  the  aircraft  they  represent, 
or  the  manufacturer  cannot  be  identified. 

The  contract  proposal  in  question  contained  a  labor 
cost  estimate  for  a  proposed  new  aircraft.  This  estimate  was 
based  on  a  linear  regression  of  the  first  unit  labor  costs 
fcr  four  previously  built  aircraft.  The  time  of  production 
and  weight  of  the  aircraft  in  this  estimate  are  similar  to 
the  aircraft  examined  in  this  study.  However,  the  contract 
proposal  aircraft  were  designed  for  two  different  missions, 
neither  of  which  was  the  mission  of  the  study  sample  aircraft. 
The  contract  proposal  did  not  indicate  what,  if  any,  adjust¬ 
ments  were  made  to  the  first  unit  labor  costs  prior  to  inclu¬ 
ding  them  in  the  estimate. 

Figs.  16,  17,  and  18  show  a  graphic  summary  of  the 
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Pig.  16.  Contract  Proposal  Aircraft 
Pirst  Unit  Labor  Costs  (hours/AMPR  pound) 
Unadjusted  Vs  AMPR  Weight 
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Pig.  17.  Contract  Proposal  Aircraft 
First  Unit  Labor  Costs  (hours/AMPR  pound) 
Adjusted  for  Time  Vs  AMPR  Weight 
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four  labor  cost  points.  Fig.  16  plots  the  labor  hours  per 
AMPR  pound  as  reported  in  the  contract  proposal.  Fig.  17  ad¬ 
justs  the  contract  proposal  points  by  dividing  them  by  T  g. 
The  T  g  used  was  for  the  production  dates  of  the  contract 
proposal  aircraft  relative  to  the  same  date  used  in  this 
study.  Fig.  18  shows  the  total  labor  hours  after  dividing 
them  by  T  g. 

The  shapes  and  slopes  of  Figs.  17  and  18  are  similar 
to  those  previously  shown  in  Figs.  13  and  14,  which  plot  the 
same  quantities  for  the  study  aircraft  sample.  The  differ¬ 
ences  could  be  caused  by  different  accounting  and  reporting 
systems,  the  differing  missions,  different  relations  between 
prototype  and  production  aircraft  for  the  contract  proposal 
aircraft,  and  adjustments  that  might  have  been  made  for  the 
purpose  of  making  the  contract  proposal  cost  estimate.  The 
differences  could  also  be  caused  by  a  lack  of  an  underlying 
relation  similar  to  the  ones  in  this  study.  It  is  not  hard 
to  see  linear  relations  in  Figs.  17  and  18  instead  of  shallow 
curves . 

But  this  is  all  speculation.  The  necessary  informa¬ 
tion  is  not  available  to  disentangle  these  questions  and  com¬ 
pare  the  two  sets  of  aircraft.  There  is  no  clear  contradic¬ 
tion  between  the  two  aircraft  sets,  and  there  is  enough  simi¬ 
larity  to  suggest  that  non-linear,  or  non-log  linear,  rela¬ 
tions  might  exist  for  aircraft  of  other  manufacturers.  If 
detailed  production  data  were  available,  it  would  not  be  dif¬ 
ficult  to  determine  if  similar  relations  do  exist. 
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Other  Questions 

Several  other  areas  were  examined  during  the  course 
of  this  analysis.  The  results  were  generally  negative. 

Rather  than  omit  them  from  this  report,  they  will  he  covered 
briefly  to  present  a  balanced  view  of  the  entire  analysis. 

Research  and  Development  Effort.  At  the  outset  of 
this  project,  it  was  felt  that  variations  in  research  and  de¬ 
velopment  effort  would  partially  explain  consistent  deviations 
(Y*-Y)  in  estimated  values  for  aircraft  labor  costs  and  its 
actual  costs.  If  one  looks  at  all  the  equations  tested,  both 
linear  and  non-linear,  there  is  no  clear  pattern  in  the  devia¬ 
tions.  Those  that  are  the  most  troublesome  in  linear  ecua- 
ticns  are  not  troublesome  in  non-linear  equations  and  vice 
versa.  If  the  linear  and  non-linear  -  equations  are  examined 
separately  there  is  a  pattern  to  the  deviations.  These  pat¬ 
terns  were  compared  to  the  B&D  variables  gathered  from  the 
manufacturer. 

The  R&D  variables  were  analyzed  graphically  in  much 
the  same  manner  as  the  labor  costs  were.  These  R&D  variables 
were  expressed  in  their  absolute  values,  as  percents,  as  ra¬ 
tios  formed  by  dividing  by  W  ,  VV  ,  in-plant  AMFR  weight,  and 
time.  They  were  adjusted  for  varying  percentages  of  sub¬ 
contracting  that  occurred  between  the  programs.  All  of  these 

variables  were  plotted  against  W_,  W_,  Time.  The  curves  pro- 

a  g 

duced  were  generally  smooth  and  consistent.  No  relation  could 
be  found  in  these  curves,  or  any  points  they  contained,  and  the 
points  in  the  sample  set  were  difficult  to  estimate  accurately. 
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Since  the  information  contained  in  these  graphs  is  privileged 
and  contributes  nothing  to  the  study,  they  will  be  omitted. 

All  the  variables  were  non-linear  and  non-log  linear  increas¬ 
ing  functions  of  weight.  This  was  viewed  as  support  for  the 
rejection  of  simple  linearity  in  labor  costs,  but  not  neces¬ 
sarily  as  support  for  the  functional,  forms  chosen  for  the  es¬ 
timating  equations. 

Learning  Curve  Slope  and  Later  Units.  The  same  tech- 

x 

niques  of  graphic  and  regression  analysis  that  were  used  for 
Y&  and  Y^  were  applied  to  the  labor  costs  of  Y^q.  No  rela¬ 
tions  were  found.  The  most  probable  cause  for  this  is  that 
production  rate  was  not  introduced  as  &  variable.  Since 
there  are  several  studies  that  provide  methods  to  estimate 
Y1(jq  it  was  decided  that  further  effort  in  this  direction 
would  only  duplicate  previous  work.  A  good  summary  of  the 
various  methods  is  presented  in  a  Rand  paper  by  Barro  (Ref  6: 
3-13). 

With  one  exception,  there  seemed  to  be  no  relation  be¬ 
tween  t.ae  slope  of  the  learning  curve  and  the  variables  used 
in  this  study.  The  one  exception  concerns  the  slope  of  the 
more  recent  aircraft  learning  curves.  They  vary  within  plus 
or  minus  2$  of  their  average  value.  When  the  adjustment  fac¬ 
tor  is  applied  to  Y^  and  the  slopes  of  Y&  are  examined,  this 
variance  is  cut  in  half.  Apparently  a  rather  good  estimate 
of  the  learning  curve  can  be  made  by  using  a  simple  average. 

An  additional  adjustment  for  production  rate  might  also  be 
made . 
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Subcontracting.  The  amount  of  subcontracting  varied 
considerably  from  one  aircraft  to  the  next.  Subcontracting 
was  considered  alone  and  in  combination  with  some  of  the  R&B 
variables.  No  relation  could  be  found  between  the  amount  of 
subcontracting  and  first  unit  labor  costs. 

AMPR  Y< eight.  The  relatively  poor  performance  of  AHPR 
weight  as  a  variable  in  the  estimating  equations  as  compared 
to  gross  weight  was  puzzling.  If  gross  weight  is  considered 
a  good  estimating  variable  and  JuAFR  weight  considered  in  re¬ 
lation  to  it,  then  the  reason  for  the  poor  performance  of 
AMPR  weight  may  be  seen.  The  ratio  of  ANPR  weight  to  gross 
weight  was  formed  for  all  aircraft  in  the  sample.  These  were 
summed  and  an  average  ratio  was  found.  The  deviations  of  the 
individual  ratios  from  the  average  had  a  range  of  18$  of  the 
average  value. 

If  the  ratios  for  all  aircraft  were  the  same,  then  the 
exchange  of  gross  weight  for  AKPR  weight  would  have  no  effect 
on  the  estimating  equations.  The  fact  that  the  ratios  vary, 
and  their  introduction  into  the  equation  produces  poorer  re¬ 
sults  shows  that  their  variance  is  not  strongly  correlated  to 
labor  costs.  This  suggests  that  some  modification  of  AMPR 
weight  might  be  appropriate  when  it  is  desirable  to  compare 
labor  costs  of  different  aircraft. 
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VII .  Conclusion 

The  goal  of  this  project  was  to  devise  a  method  to 
provide  small  variance  estimates  of  first  unit  labor  cost.*;. 

This  has  been  done,  though  not  in  the  manner  anticipated. 

This  project  has  raised  more  questions  than  it  has  answered. 

The  purpose  here  is  to  discuss  the  questions  for  which  an¬ 
swers  should  be  attempted  before  the  non-linear  estimating 
equations  can  be  used  confidently. 

Pre-production  Learning 

The  first  adjustment  made  to  the  labor  cost  data  was 
to  remove  prototype  aircraft  from  consideration  and  add  a 
pre-production  learning  factor  to  the  lot  mid  points  of  those 
programs  that  contained  prototypes.  The  basis  for  selecting 
the  pre-production  learning  factor  was  largely  intuitive. 
Opinions  were  formed  through  talks  with  company  officials, 
examination  of  the  labor  data,  reading  historical  accounts  of 
the  development  of  the  aircraft  provided  by  the  company  and 
reading  historical  accounts  fo  uid  in  aircraft  fact  books  com¬ 
mon  to  most  libraries  such  as  Jane’s  All  the  Worlds  Aircraft, 
published  yearly  by  McGraw-Hill  of  New  York. 

When  these  adjustments  were  included  in  the  estimating 
relations,  they  produced  consistently  better  results  than  when 
they  were  excluded.  This  does  not  prove  their  validity,  but 
it  suggests  that  they  were  adjustments  made  in  the  right  direc¬ 
tion.  These  adjustments  were  made  with  little  input  from  ex¬ 
perts  in  the  field  of  engineering  or  production.  It  would  be 
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desirable  to  quantify  this  entire  adjustment  process  and  te.-;t 
several  alternative  forms  for  reasonableness. 

The  reason  this  is  so  important  is  that  first  unit 
labor  costs  are  very  sensitive  to  pre-produition  learning  ad¬ 
justment.  A  change  of  only  .25  to  all  .lot  mid  points  alters 
first  unit  labor  costs  between  1.0  and  1.5  hours  per  pound. 
Notice,  from  Table  XII,  that  changes  of  less  than  ,25  to  the 
learning  adjustment  factors  used  in  this  analysis  could  make 
the  estimating  equations  virtually  perfect  or  double  their 
variance . 

AKPR  Weight- 

As  was  mentioned  previously,  whenever  AMTR  weight  is 
replaced  as  an  independent  variable  in  either  the  linear  or 
non-linear  equations,  an  improvement  in  fit  and  estimating 
ability  usually  results.  This  is  particularly  noticeable  in 
equations  (15)  through  (30)  shown  in  Tables  VIII  through  XI. 
It  is  also  possible  to  remove  AMPR  weight  from  the  dependent 
variable.  Recall  that  Y  is  in  terms  of  hours  per  AKPH  pound 
and  H  is  in  hours  alone.  Whenever  a  change  in  dependent 
variable  is  made  from  Y  to  H  the  same  improvement  is  noted. 
Tables  VIII  and  IX  are  a  good  example.  In  each  case  when  Y 
(in  Table  VIII)  is  replaced  by  K  (in  Table  IX)  a  great  im¬ 
provement  is  noted  in  fit  and  estimating  ability.  The  same 
improvement  is  noted  within  each  table,  though  not  so  pro¬ 
nounced,  when  W  is  replaced  by  W  . 

a  o 

This  leads  one  to  question  the  wisdom  of  dividing 
total  labor  hours  by  AMPK  weight  when  it  is  intended  to 
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compare  first  unit  labor  costs  of  two  or  more  different  air¬ 
craft.  Some  weight  factor  should  be  vsed  to  make  comparisons 
possible.  AMPR  weight  apparently  is  not  an  optimal  choice. 
This  is  another  case  where  expert  opinion  in  the  field  of  en¬ 
gineering  and  production  might  be  profitably  used.  The  end 
resulx  should  be  to  identify  another  weight  variable  that  has 
the  desired  properties  of  AMPR  weight  (the  ability  to  relate 
design  and  weight  changes  to  labor  costs  in  the  learning 
curve  for  a  specific  aircraft)  and  at  the  same  time  make  the 
labor  costs  of  two  different  aircraft  models  comparable. 

Linearity  vs  Non-linearity 

There  is  no  reason  why  non-linear  relations  should  not 
be  expected  to  provide  the  best  cost  estimating  relations. 

It  has  been  convention  to  use  linear  (and  log  linear  is  in¬ 
cluded  in  this  term)  -elations  and  convention  is  hard  to  over¬ 
come.  Linear  relations  do  have  advantages  in  simplicity  and 
the  ability  to  not  make  gross  errors  if  they  are  improperly 
formulated  (see  Table  VII  for  gross  errors  with  poor  non¬ 
linear  relations).  But  if  thorough  investigation  supports 
the  existence  of  non-linearity,  then  it  is  much  more  accurate 
to  use  non-linear  functions  rather  than  approximate  with 
linear  ones. 

Thorough  investigation  in  this  case  would  mean  the  in¬ 
vestigation  of  other  aircraft  types  and  manufacturers.  The 
problems  encountered  in  this  study  with  privileged  informa¬ 
tion  would  make  it  almost  impossible  for  an  individual  to 
gain  access  to  a  cross  section  of  information  necessary  to 
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properly  investigate  this  que3tion.  A  simple  solution  of 
this  problem,  however,  is  available  to  the  government.  The 
government  could  require  manufacturers  submitting  contract 
proposals  to  include  raw,  unaltered  labor  cost  data  to  sup- 
port  the  cost  estimates  included  in  their  contract  proposals. 

As  data  accumulated,  it  would  be  a  simple  matter  to  analyze 
it.  Without  this  or  some  similar  minimum  step,  it  would  be 
unwise  to  apply  the  equations  in  this  report  to  other  manu¬ 
facturers  or  to  other  types  of  aircraft  built  by  the  manufac¬ 
turer  who  supplied  the  data  for  this  analysis. 

Use  of  Non-linear  Estimating  Equations 

The  equations  summarized  in  Table  XV  provide  estimates 
for  either  Y  or  H  relative  to  the  base  year  used  in  this 
study.  To  convert  these  values  to  later  time  periods,  it  is 
necessary  to  multiply  the  values  produced  by  the  equations  by 
an  appropriate  value  of  T  g  for  the  later  time.  If  it  is 
felt  that  a  different  learning  fraction  applies  (other  than 
.8)  since  the  base  year,  then  another  straight  line  time  may 
be  used. 

The  variance  of  the  error  term  in  the  estimating  equa¬ 
tions  and  the  narrowness  of  the  80#  confidence  intervals  imply 
that  the  use  of  the  estimating  equation  should  produce  very  ac¬ 
curate  results.  This  is  especially  true  when  the  weight  of  the 
estimated  aircraft  is  within  the  range  of  weights  used  for  this 
study.  One  should  not  expect  such  good  results  when  predic¬ 
ting  for  an  aircraft  that  is  not  in  this  range.  Reasonable 
bounds  do  exist,  however,  for  estimates  when  technology  is 
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being  extended.  If  the  range  of  previously  experienced  high 
and  low  values  for  first  unit  costs  was  used  as  bounds  for 
cost  estimates  whenever  this  manufacturer  extended  technology, 
an  error  of  more  than  15 i*  could  not  have  been  caused  by  the 
imposition  of  these  limits.  That  is  not  to  fay  that  larger 
errors  could  not  happen,  but  any  estimate  that  exceeds  these 
limits  should  be  fully  supported  by  detailed  analysis  and 
viewed  most  critically. 

All  of  the  equations  derived  in  this  study  produce 
estimates  for  the  labor  costs  of  the  first  unit  assuming  that 
there  had  been  no  prototype  produced  in  the  program.  If  a 
prototype  is  planned  for  a  proposed  program,  then  the  estimates 
produced  by  the  equations  in  this  study  must  be  adjusted  down¬ 
ward.  No  procedure  is  offered  here  to  do  this.  Since  the  ad¬ 
justments  used  in  this  study  to  remove  prototype  influences 
from  programs  was  largely  intuitive  and  as  yet  not  proved,  a 
similar  procedure  only  in  reverse  might  be  used  to  introduce 
prototype  influences  on  first  unit  production  costs. 

Air  Force  Use  of  the  Results  of  This  Study 

There  is  one  agency  within  the  Air  Force  that  is 
ideally  suited  for  investigating  the  results  of  this  study 
and  extending  its  usefulness.  Whenever  a  major  project  is 
undertaken,  a  System  Program  Office  (SPO)  is  created  to 
manage  the  effort.  In  the  case  of  airframe  mamifacture,  the 
SPO  has  extensive  contact  with  the  bidding  manufacturers.  It 
should  be  a  relatively  simple  matter  for  the  SPO  to  gather  a 
consistent  set  of  data  from  each  manufacturer  and  determine 
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the  appropriate  form  estimating  equations  should  take  for  the 
type  of  airframe  in  question.  It  also  v/ould  not  be  difficult 
to  examine  the  results  of  several  SPO  analyses  to  see  if  there 
are  industry  wide  applications  of  non-linear  labor  estimating 
equations.  In  fact,  it  could  easily  be  determined  if  non¬ 
linear  forms  are  the  most  appropriate. 

The  fact  that  no  advanced  02*  difficult  mathematical 
techniques  were  used  in  this  analysis  places  the  methodology 
of  this  study  well  within  the  range  of  competence  of  the  SPO. 
The  extensive  knowledge  of  the  SPO  regarding  the  accounting 
systems  and  adjustment  techniques  of  the  various  airframe 
manufacturers  should  allow  them  to  treat  data  in  a  much  nore 
detailed  manner  than  used  here. 
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Appendix  A 


The  Kolomogorov  Smirnov  Test  for  Normality 


The  purpose  of  this  appendix  is  tc  outline  the  compu¬ 
tations  used  in  this  study  to  insure  that  the  error  terms  of 
the  selected  estimating  equations  could  he  assumed  normal. 

The  critical  values  of  the  Koioraogorov-Smimov  statistic  and 
the  computational  procedures  can  be  found  in  Lilliefor's  ar¬ 
ticle,  "On  the  Kolomogorov-Smimov  Test  for  Normality  with 
the  Kean  and  Variance  Unknown,"  in  the  Journal  of  the  Ameri¬ 
can  Statistical  Association,  Vol.  62,  pages  399-402,  1967. 

For  the  construction  of  the  confidence  intervals  used 

in  the  study  it  is  desired  to  assume  that  the  error  terns  are 

distributed  normally  with  mean  zero  and  variance  equal  to  the 

estimated  variance  (S  )  provided  by  the  multiple  regression 

computer  program.  To  make  this  test,  it  is  first  necessary 

to  calculate  the  error  terms  for  each  equation.  Since  there 

♦ 

are  seven  points  included  in  the  regression  analysis,  there 
are  seven  error  terms  in  each  of  the  eight  selected  estimating 
equations.  The  error  terms  are  found  by 


where  is  the  error  term  for  the  ith  aircraft,  Y*  is  the 
value  of  Y  for  the  ith  aircraft  produced  by  the  estimating 
equation,  and  Y^  is  the  actual  value  of  Y  for  aircraft  i. 
The  error  terms  are  next  ordered,  chat  is  arranged  in  order 
of  increasing  magnitude,  and  given  a  new  subscript  equal  to 
their  positions  and  designated 


♦ 
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X 


(i) 


i  =  1  •  •  »7 


The  cumulative  distribution  funct5.on  for  a  normal 

2 

random  variable  with  mean  zero  and  variance  S  will  be  de¬ 
noted  by  Z.  It  is  also  necessary  to  define  the  statistic 
as 


The  Kolomogorov-Smirnov  Statistic  (D*)  is 


D*  =  Max  Di  i»  1-7 


From  the  above  reference,  the  critical  values  for  the 
significance  levels  shown  are 


Significance  Level 

.20 

.10 

.01 

Critical  Values 

CM 

• 

.276 

.348 

As  long  as  D*  is  less 

than  .247,  there 

is  no 

reject  normality.  In  all  eight  equations,  this  condition  was 
satisfied. 

The  calculations  for  equation  (20)  produced  the  lar¬ 
gest  value  of  D*  and  for  that  reason  it  is  tabulated  below. 
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TABLE  XVI 

SUMMARY  OP  THE  XOLOMOGOROV-SMIRNOV  TEST 
FOR  NORMALITY  OP  EQUATION  (20)  S  =  33.26 


i 

x(t)  z<x<ip 

i-1 

“T 

i 

7 

Di 

1 

-43.74 

.093 

.00 

.143 

.093 

2 

-12.63 

.352 

.143 

.286 

.209 

3 

-  6.43 

.425 

.286 

.429 

.139 

4 

-  5.36 

.436 

.429 

.571 

.135 

5 

0.78 

.508 

.571 

.714 

.206 

6 

28.38 

.802 

.714 

.857 

.088 

7 

38.99 

.879 

.857 

1.000 

.121 

Prom 

this  table 

,  it  can  be  seen  that  D* 

=  .209  and  we 

therefore  do 

not  reject 

the  hypothesis 

that  the 

error  terms 

are  distributed  normally.  D* 

for  the 

remaining 

equations  is 

shown  in  the  following  table. 


Equation  Number  21  22  25  26  28  29  30 

D*  .186  .203  .169  .146  .162  .169  .155 
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*  JOfi  MULTIPLE  REGRESS I  ON—AUTOCORRELAT  I  ON 

0  LOG  OPTION 

C  THIS  PROGRAM  IS  WRITTFN  In  KINGSTRAN. 

n i mfnc ion  x ( ?n » i o > « y ; ?o > « 7 ( i o » ?n ) ♦ xx { i o « l o > 
1XY(10)  ,XPARUO)  ,S( 10*10)  ,3F7A(10)  ,UCAP<?0) 

1  DUMMY (20) »DSSP< 10) ,CPD< 10) 

DATA  INPUT#  LOG  CONVERSION  IF  SELECTED  AND 
INPUT  READ  BACK. 

READ*  KSETS 

101  READ*  M*  N,  LOOP*  LN 
INDEX si 

DO  201  I  =  1 *M 
READ*  Y ( I ) «  <X< I  * J) « J=1 «N> 

PUNCH*  YU)*  (XdiJIi  J=1«N) 

I F ( LN ) 201 *201 *227 
227  CONTINUE 

DO  202  J=1 « N 

202  X(I«J)  "  LOGF (XI  I  *  J ) ) 

Y  (  I  )  =  LOGF  (YU  )  ) 

201  CONTINUE 

I F ( LN  >312*312*234 

234  PUNCH  235 

235  FORMAT ( / *  1 4HL0G  REGRESS  1 ON ♦/ > 

GO  TO  312 

004  PUNCH  003 

DO  190  L=IN0EX*M 

190  PUNCH,  Y  {  L. )  »  (X(L*K),  K=  1  *N  ) 

312  CONTINUE 

C  CONVERSION  OF  INPUT  DATA  TO  DEVIATION  FORM. 

DO  10  J=1 «N 
XBAR{ J) =0. 

DO  20  I  - INDEX*M 
20  XRARC J)=XRAR< J)+X(  I  * J) 

10  XBAPC J)=X3AR< J > /FLOAT ( M- I NDEX+1 ) 

PUNCH  301 

301  FORMAT (/* 7HXRAR  (  J ) ) 

PUNCH,  <XBAR(  I  ) *  1  =  1  «N) 

YRAR=0. 

DO  30  I  =  I NPEX  *  M 
30  YBAR=YRAR+Y ( I) 

YBAR=YBAR/FLOAT(M» INDEX+1 ) 

PUNCH  302*  YEAR 

302  FORMAT </*7HYBAR  =  *E15.8) 

DO  50  J  =  1  ♦  is! 

DO  40  I « INDEX, M 
4  0  X(  I  *U)=X(  I  *  J)*-XRAR<  J) 

50  CONTINUE 
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DO  60  I  =  INDEX ,M 
60  Y(n=YU)-VBAR 

CALCULATION  OF  REGRESSION  COEFFICIENTS  AND 
OTHER  STATISTICAL  QUANTITIES. 

00  70  J=1 »N 
DO  HO  I  =  1  ,  M 
80  Z(J«n=X(I«J) 

70  CONTINUE 

CALL  MULT (N,M,N,Z,X,XX) 

PUNCH  SOS 

303  FORMAT  < /,2HXX ) 

PUNCH.  <(XX<I,J),J=1,N),I=1,N> 

CALL  MULT { N  ,  M  ,  1 .Z.Y.XY) 

PUNCH  304 

304  FORMAT I/.2HXY) 

PUNCH.  (XY{ J) *J*1 .N) 

DO  100  Itsl.N 

DO  90  J=1 .N 
90  SCI »J)=XX(I«J) 

100  CONTINUE 

CALL  INVERT  <N,S.MATXX) 

I F  (MATXX )  313.001  .313 
313  PUNCH  Sun 

305  FORMAT (/. 10HXX  INVERSE) 

PUNCH.  ( CSC  I «J) »  J= 1 .N) » I  =  1 »N) 

CALL  MULT  (N.N. 1 ♦  S « X Y . BET A ) 

PUNCH  306 

306  FORMAT ( / « 7H8ETA ( J )  ) 

PUNCH,  (RETAI J) « Jsl ,N) 

SUM=0. 

DO  IIP  I=1,N 

110  SUM=SUM+PETA<I)*XRAR< I > 

8ZER0=YBAR-SUM 
PUNCH  307,  BZERO 

307  FORMAT (/« 12HBETA  ZERO  =  .E15.8) 

YY  =  0. 

DO  203  I = INDEX, M 

203  YY  =  YY  +  Y< I )*Y< I  ) 

PUNCH  204,  YY 

204  FORMAT ( 5HYY  =  ,E1E.Q) 

BXY  =  0. 

DO  205  J=1 , N 

205  BXY  =  BXY  +  XY ( J > *BE7A ( J ) 

PUNCH  206,  BXY 

206  FORMAT ( 6HBXY  =  ,E15*8> 

EE  =  YY-BXY 

EENK  =  EE/FLOAT < M-N- 1 > 

PUNCH  207,  EE 

207  FORMAT <  5HEE  =  ,E15.6> 

PUNCH  208,  EENK 

208  FORMAT (9HEE/N-K  =  ?E15.8> 
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oo  =  ayv/vY 

cor  or  =  fr*«.5 
PUNCH  220,  PP 

?2  '  =-OPvAT(  4HOQ  =  .  ~  1  1  .  8  ) 

punch  pp i ,  cocop 

221  FORMAT* 21 HCOFF  OF  CORRELATION  ?*F11.8) 

PQADJ  =  PP- (FLOAT  {  N) /^LOAT  {  M-N-l  )  )*U  .-PR) 

PUNCH  ?.PP%  RPADJ 

222  FOPMATUnHPP  ADJUSTED  =tp!!»fl) 

PUNCH  224 

224  FORMAT { / \ 2°HCOFF  OF  PARTIAL  DETERMINATION) 

00  223  J=1 .N 

DSSR(J)  =  (  RFT  A  (  J )  **2± )  /S  (  J  ♦  J ) 

CPO(J)  =  DSSR( J)/(DSSR( J)+FF) 

223  PUNCH  230 1  CPD(J) 

230  FORMAT ( F 1 1 #8 ) 

PUNCH  225 

225  FORMAT (/«27HC0EF  OF  PARTIAL  CORRELATION) 

DO  231  J=1 t  N 

231  PUNCH  232 «  CPD( J) ** ,5 

232  FORMAT ( FI  1 .8) 

PUNCH  226 

226  F0RMAT(/«?2HVAR-C0VAP IAMCF  OF  BFTA) 

punch,  < ( cs( i « j)*^-nk) . j=i »n) , 1=1 ,n> 

Punch  21  0,  8XY/  ( EFN<*FLOAT  ( N  )  ) 

210  FORMAT (3HF  =.E15.Q) 

DUNCH  211*  N«  (M-N-l) 

211  FORMAT  (/ ,  34HPAR.'»Vi£TERS  01  THE  F  STATISTIC  ARE  ,15.15) 
PUNCH  006 

IF(LOOP-l)  005,001,001 

C  CALCULATION  OF  AUTOCORRELATION  DATA  IF 

C  fpLc-cTFD, 

005  SAMFAN  =  0« 

DO  120  I=INDFX,M 

Dummy  < i )  =  o. 

00  130  J=1»N 

130  DUMMY(I)  =  8FTA( J) *(X< I , J)+XPAP( J) >  +  DUMMY { I ) 

UCAP(I)  =  Y(  I  H-YPAR-nUMMY<  D-RZERO 
FAMEAN  =  SAMFAN  +  UCAP(I) 

120  PUNCH,  UCAP(l) 

SAMFAN  =  SAMFAN/FLOAT  <  M~ I NDFX+1  ) 

DUNCH  300,  SAMFAN 

SOQ  FOPMAT(/, 1«HMFAN  Oc  YHAT  =  »E15*B) 

SUMSOssO  • 

DO  140  I = I NDFX+1 , M 

1  a  0  FuMFQ=(ur.AP(  I  )-UCAP(  t-1  5  )  +  {  UCAP  (  I  )-UCAP(  1-1  )  >+.SUMSO 
dfnOm=o« 

DO  1F0  I = I NDFX  ,M 

1 50  OENOM=UCAP ( I ) *UCAP ( . ) +OFNOM 
0=?UMS0/DEN0M 
PUNCH  006 
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<ujmso=0. 

DiTNOMrnFNOM-UCAP  t  M  )  »UCAP !  M  > 
no  160  I  =  INDEX+  1  4  M 
1 60  CUMSP=UCAP ! I ) *UCAO i ! - 1 > *3UMSO 
Rr  SUMSO/DENOM 
PUNCH  31 C«  0 

310  F  ORMAT  ( /  «  4HD  =  '.<£15*81 
PUNCH  31 1«  P 

311  FORMAT!  ✓wHR  =  ,£15*8) 

PUNCH  006 

TYPE  310«  D 
TYPE  3 1 1 t  R 
ACCEPT  »L 

IF!  1-L)  102  tOOl  4002 
002  PUNCH  006 

DO  170  1=M» INOEX+I 4-1 

YCI)  a  Y< I J+YBAR-R*! Y(I«1 l+YBAR) 

DO  180  J- 1 ♦ N 

180  X ( I  4 J ) =X(I « JJ+XBARC J)-R*!X! 1-1 1 Jl+XBAR! J  > ) 
170  CONTINUE 

Y1INDEX)  =0. 

DO  181  J=liN 
181  X  H  NDEX  «  J )  =  0 ♦ 

INDEX=INDEX+1 
GO  TO  004 
003  FORMAT!/) 

006  FORMAT  !//) 

001  KSETS  =  KSETS-1 

IF! KSETS ) 209  »  209  4  1 0 1 
209  STOP 
ENC 

C  MATRIX  MULTIPLICATION  SUBROUTINE. 

SUBROUTINE  MULT!M4N4<4A4B4C) 

DIMENSION  A! 10420) 48(204 101 «C! 104 10) 

DO  1  1=1 «M 

DO  1  J=l4K 
SUM=0. 
no  11  L= 1 4N 

11  SUM=SUM+A ! I  4  L ) *S ( L  4  J ) 

1  C ! I  4  J ) =SUM 

RETURN 
END 

C  MATRIX  INVERSION  SUBROUTINE. 

SUBROUTINE  INVERT <N40 4 MATXK) 

DIMENSION  A!  10420)  4  BdOtlO) 

MATXX  =  0 
DO  24  1=1 4N 
DO  24  J= 1 4 N 
24  4 ( I  4 J)=B< I  4 J) 
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V.=N+1 
\'N  =  M+N.' 

OO  ?  1  =  1  «  N 

r»0  ?  J=V  ,mm 

2  S(I«J>=0. 
no  3  1=1 *N 

3  M I «  N+ I)  =  1 • 

DO  15  K=1  «N 
RIOA=AR5(A(K»IO  > 

jj=:< 

DO  6  J=K»N 

IFCARSCAC JiK)  1  —B I GA  )  6  » 6  » 7 

6  CONTINUE 
GO  TO  8 

7  3  I GA=ABS { A { J  «K ) ) 

JJ=J 

GO  TO  6 

8  IF(RIGA)Q» 10.9 

10  PUNCH  300 

300  FORMAT (26H INPUT  MATRIX  A  IS  SINGULAR) 
RFTURN 

9  IFCK-JJ) 12. 1 1 . 12 

12  CO  13  MM=K  <  NN 
TEMP=A(K»MM ) 

A  ( K  »  MM ) = A ( J J  4 MM ) 

13  A ( JJ « MM ) sTFMP 

11  DENO=ACK«K> 

DO  14  KK=K « NN 

14  A ( K . KK ) =  A ( K .  KK ) /DE  NO 
DO  15  JK= 1  *  N 
T£MP=AC JKiK) 

IFC JK-K) 16.15.16 

16  DO  17  I  I =K . NN 

17  A( JK.  I  I  >=A< JK« I  I )-TEMP*ACKl  I  1 ) 

15  CONTINUE 

DO  25  1  =  1  .N 
DO  25  J= 1 . N 
25  B( I . J)=A( I . J+N) 

MATXX  =  l 

RFTURN 

CN0 
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